Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Inferring the finest pattern of mutual independence from data (2306.12984v1)

Published 22 Jun 2023 in stat.ML, cs.LG, math.ST, stat.ME, and stat.TH

Abstract: For a random variable $X$, we are interested in the blind extraction of its finest mutual independence pattern $\mu ( X )$. We introduce a specific kind of independence that we call dichotomic. If $\Delta ( X )$ stands for the set of all patterns of dichotomic independence that hold for $X$, we show that $\mu ( X )$ can be obtained as the intersection of all elements of $\Delta ( X )$. We then propose a method to estimate $\Delta ( X )$ when the data are independent and identically (i.i.d.) realizations of a multivariate normal distribution. If $\hat{\Delta} ( X )$ is the estimated set of valid patterns of dichotomic independence, we estimate $\mu ( X )$ as the intersection of all patterns of $\hat{\Delta} ( X )$. The method is tested on simulated data, showing its advantages and limits. We also consider an application to a toy example as well as to experimental data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Combinatorial Theory. volume 234 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin.
  2. An Introduction to Multivariate Statistical Analysis. Wiley Publications in Statistics, John Wiley and Sons, New York.
  3. A multivariate nonparametric test of independence. Journal of Multivariate Analysis 97, 1742–1756.
  4. Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage. Statistica Sinica 10, 1281–1311.
  5. Discovering the false discovery rate. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 72, 405–416.
  6. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 57, 289–300.
  7. On the structure of abstract algebras. Mathematical Proceedings of the Cambridge Philosophical Society 31, 433–454.
  8. Lattice Theory. volume XXV of American Mathematical Society Colloquium Publications. 3rd ed., American Mathematical Society, Providence, Rhode Island.
  9. On least favorable configurations for step-up-down tests. Statistica Sinica 24, 1–23.
  10. Elements of Information Theory. Wiley Series in Telecommunications and Signal Processing, Wiley.
  11. Testing for independence by the empirical characteristic function. Journal of Multivariate Analysis 16, 290–299.
  12. Markov fields and log-linear interaction models for contingency tables. The Annals of Statistics 8, 522–539.
  13. An introduction to ROC analysis. Pattern Recognition Letters 27, 861–874.
  14. Nonparametric Methods in Statistics. John Wiley and Sons, New York.
  15. Das statistische Problem der Korrelation als Variations- und Eigenwertproblem und sein Zusammenhang mit der Ausgleichsrechnung. Zeitschrift für Angewandte Mathematik und Mechanik 21, 364–379.
  16. False discovery rate control, in: Toga, A.W. (Ed.), Brain Mapping. Academic Press. Elsevier Reference Collection in Neuroscience and Biobehavioral Psychology.
  17. A non-parametric test of independence. Annals of Mathematical Statistics 19, 546–557.
  18. Introduction to Mathematical Statistics. 6th ed., Prentice Hall.
  19. Rank correlation and tests of significance involving no assumption of normality. Annals of Mathematical Statistics 7, 29–43.
  20. Likelihood ratio tests for covariance matrices of high-dimensional normal distributions. Journal of Statistical Planning and Inference 142, 2241–2256.
  21. A general correlation coefficient for directional data and related regression problems. Biometrika 67, 163–173.
  22. A new measure of rank correlation. Biometrika 30, 81–93.
  23. Estimating mutual information. arXiv:cond-mat/0305641 [cond-mat.stat-mech].
  24. Information Theory and Statistics. Dover, Mineola, NY.
  25. Robust test for independence in high dimensions. Communications in Statistics – Theory and Methods 46, 10036–10050.
  26. Testing independence in high dimensions using Kendall’s tau. Computational Statistics and Data Analysis 117, 128–137.
  27. Asymptotic Bayesian structure learning using graph supports for Gaussian graphical models. Journal of Multivariate Analysis 97, 1451–1466.
  28. Automated extraction of mutual independence patterns using bayesian comparison of partition models. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 2299–2313.
  29. A Bayesian alternative to mutual information for the hierarchical clustering of dependent random variables. PLoS ONE 10, e0137278.
  30. On false discovery rate threshold for classification under sparsity. The Annals of Statistics 40, 2572–2600.
  31. Combinatorial Algorithms for Computers and Calculators. 2nd ed., Academic Press, Orlando, FL, USA.
  32. Kernel-based tests for joint independence. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80, 5–31.
  33. On measures of dependence. Acta Mathematica Academiae Scientiarum Hungaricae 10, 441–451.
  34. Detecting novel associations in large data sets. Science 334, 1518–1524.
  35. The number of partitions of a set. The American Mathematical Monthly 71, 498–504.
  36. Asymptotic prior to posterior analysis for graphical gaussian models, in: Vichi, M., Opitz, O. (Eds.), Classification and Data Analysis. Springer, pp. 335–342.
  37. Testing for complete independence in high dimensions. Biometrika 92, 951–956.
  38. A test for independence of two sets of variables when the number of variables is large relative to the sample size. Statistics and Probability Letters 78, 3096–3102.
  39. The proof and measurement of association between two things. The American Journal of Psychology 15, 72–101.
  40. Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 66, 187–205.
  41. The distance correlation t𝑡titalic_t-test of independence in high dimension. Journal of Multivariate Analysis 117, 193–213.
  42. The use of fast Fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms. IEEE Transactions on Audio and Electroacoustics 15, 70–73.
  43. Graphical Models in Applied Multivariate Statistics. J. Wiley and Sons, Chichester.
  44. East side, west side. URL: http://www.math.upenn.edu/~wilf/lecnotes.html.
  45. Biostatistical Analysis. 5th ed., Pearson Prentice Hall, Upper Saddle River, NJ.

Summary

We haven't generated a summary for this paper yet.