Inferring the finest pattern of mutual independence from data (2306.12984v1)
Abstract: For a random variable $X$, we are interested in the blind extraction of its finest mutual independence pattern $\mu ( X )$. We introduce a specific kind of independence that we call dichotomic. If $\Delta ( X )$ stands for the set of all patterns of dichotomic independence that hold for $X$, we show that $\mu ( X )$ can be obtained as the intersection of all elements of $\Delta ( X )$. We then propose a method to estimate $\Delta ( X )$ when the data are independent and identically (i.i.d.) realizations of a multivariate normal distribution. If $\hat{\Delta} ( X )$ is the estimated set of valid patterns of dichotomic independence, we estimate $\mu ( X )$ as the intersection of all patterns of $\hat{\Delta} ( X )$. The method is tested on simulated data, showing its advantages and limits. We also consider an application to a toy example as well as to experimental data.
- Combinatorial Theory. volume 234 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin.
- An Introduction to Multivariate Statistical Analysis. Wiley Publications in Statistics, John Wiley and Sons, New York.
- A multivariate nonparametric test of independence. Journal of Multivariate Analysis 97, 1742–1756.
- Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage. Statistica Sinica 10, 1281–1311.
- Discovering the false discovery rate. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 72, 405–416.
- Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 57, 289–300.
- On the structure of abstract algebras. Mathematical Proceedings of the Cambridge Philosophical Society 31, 433–454.
- Lattice Theory. volume XXV of American Mathematical Society Colloquium Publications. 3rd ed., American Mathematical Society, Providence, Rhode Island.
- On least favorable configurations for step-up-down tests. Statistica Sinica 24, 1–23.
- Elements of Information Theory. Wiley Series in Telecommunications and Signal Processing, Wiley.
- Testing for independence by the empirical characteristic function. Journal of Multivariate Analysis 16, 290–299.
- Markov fields and log-linear interaction models for contingency tables. The Annals of Statistics 8, 522–539.
- An introduction to ROC analysis. Pattern Recognition Letters 27, 861–874.
- Nonparametric Methods in Statistics. John Wiley and Sons, New York.
- Das statistische Problem der Korrelation als Variations- und Eigenwertproblem und sein Zusammenhang mit der Ausgleichsrechnung. Zeitschrift für Angewandte Mathematik und Mechanik 21, 364–379.
- False discovery rate control, in: Toga, A.W. (Ed.), Brain Mapping. Academic Press. Elsevier Reference Collection in Neuroscience and Biobehavioral Psychology.
- A non-parametric test of independence. Annals of Mathematical Statistics 19, 546–557.
- Introduction to Mathematical Statistics. 6th ed., Prentice Hall.
- Rank correlation and tests of significance involving no assumption of normality. Annals of Mathematical Statistics 7, 29–43.
- Likelihood ratio tests for covariance matrices of high-dimensional normal distributions. Journal of Statistical Planning and Inference 142, 2241–2256.
- A general correlation coefficient for directional data and related regression problems. Biometrika 67, 163–173.
- A new measure of rank correlation. Biometrika 30, 81–93.
- Estimating mutual information. arXiv:cond-mat/0305641 [cond-mat.stat-mech].
- Information Theory and Statistics. Dover, Mineola, NY.
- Robust test for independence in high dimensions. Communications in Statistics – Theory and Methods 46, 10036–10050.
- Testing independence in high dimensions using Kendall’s tau. Computational Statistics and Data Analysis 117, 128–137.
- Asymptotic Bayesian structure learning using graph supports for Gaussian graphical models. Journal of Multivariate Analysis 97, 1451–1466.
- Automated extraction of mutual independence patterns using bayesian comparison of partition models. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 2299–2313.
- A Bayesian alternative to mutual information for the hierarchical clustering of dependent random variables. PLoS ONE 10, e0137278.
- On false discovery rate threshold for classification under sparsity. The Annals of Statistics 40, 2572–2600.
- Combinatorial Algorithms for Computers and Calculators. 2nd ed., Academic Press, Orlando, FL, USA.
- Kernel-based tests for joint independence. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80, 5–31.
- On measures of dependence. Acta Mathematica Academiae Scientiarum Hungaricae 10, 441–451.
- Detecting novel associations in large data sets. Science 334, 1518–1524.
- The number of partitions of a set. The American Mathematical Monthly 71, 498–504.
- Asymptotic prior to posterior analysis for graphical gaussian models, in: Vichi, M., Opitz, O. (Eds.), Classification and Data Analysis. Springer, pp. 335–342.
- Testing for complete independence in high dimensions. Biometrika 92, 951–956.
- A test for independence of two sets of variables when the number of variables is large relative to the sample size. Statistics and Probability Letters 78, 3096–3102.
- The proof and measurement of association between two things. The American Journal of Psychology 15, 72–101.
- Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 66, 187–205.
- The distance correlation t𝑡titalic_t-test of independence in high dimension. Journal of Multivariate Analysis 117, 193–213.
- The use of fast Fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms. IEEE Transactions on Audio and Electroacoustics 15, 70–73.
- Graphical Models in Applied Multivariate Statistics. J. Wiley and Sons, Chichester.
- East side, west side. URL: http://www.math.upenn.edu/~wilf/lecnotes.html.
- Biostatistical Analysis. 5th ed., Pearson Prentice Hall, Upper Saddle River, NJ.