Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 88 tok/s
Gemini 2.5 Pro 53 tok/s Pro
GPT-5 Medium 15 tok/s
GPT-5 High 11 tok/s Pro
GPT-4o 102 tok/s
GPT OSS 120B 457 tok/s Pro
Kimi K2 203 tok/s Pro
2000 character limit reached

Inference for Heterogeneous Graphical Models using Doubly High-Dimensional Linear-Mixed Models (2403.10034v1)

Published 15 Mar 2024 in stat.ME and stat.AP

Abstract: Motivated by the problem of inferring the graph structure of functional connectivity networks from multi-level functional magnetic resonance imaging data, we develop a valid inference framework for high-dimensional graphical models that accounts for group-level heterogeneity. We introduce a neighborhood-based method to learn the graph structure and reframe the problem as that of inferring fixed effect parameters in a doubly high-dimensional linear mixed model. Specifically, we propose a LASSO-based estimator and a de-biased LASSO-based inference framework for the fixed effect parameters in the doubly high-dimensional linear mixed model, leveraging random matrix theory to deal with challenges induced by the identical fixed and random effect design matrices arising in our setting. Moreover, we introduce consistent estimators for the variance components to identify subject-specific edges in the inferred graph. To illustrate the generality of the proposed approach, we also adapt our method to account for serial correlation by learning heterogeneous graphs in the setting of a vector autoregressive model. We demonstrate the performance of the proposed framework using real data and benchmark simulation studies.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (87)
  1. Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis. Genetics, 177(1):577–585, 2007.
  2. S. Basu and G. Michailidis. Regularized estimation in sparse high-dimensional time series models. The Annals of Statistics, 43(4):1535–1567, 2015.
  3. R. Bhatia. Perturbation bounds for matrix eigenvalues. SIAM, 2007.
  4. Fixed effects testing in high-dimensional linear mixed models. Journal of the American Statistical Association, 115(532):1835–1850, 2020.
  5. G. Bresler. Efficiently learning ising models on arbitrary graphs. In Proceedings of the forty-seventh annual ACM symposium on Theory of computing, pages 771–782, 2015.
  6. Emotional inertia contributes to depressive symptoms beyond perseverative thinking. Cognition and Emotion, 29(3):527–538, 2015.
  7. P. Bühlmann and S. Van De Geer. Statistics for high-dimensional data: methods, theory and applications. Springer Science & Business Media, 2011.
  8. A constrained ℓℓ\ellroman_ℓ1 minimization approach to sparse precision matrix estimation. Journal of the American Statistical Association, 106(494):594–607, 2011.
  9. Vector autoregression, structural equation modeling, and their synthesis in neuroimaging data analysis. Computers in biology and medicine, 41(12):1142–1155, 2011.
  10. Selection and estimation for mixed graphical models. Biometrika, 102(1):47–64, 2015.
  11. Bayesian vector autoregressive model for multi-subject effective connectivity inference using multi-modal neuroimaging data. Human brain mapping, 38(3):1311–1332, 2017.
  12. A. Defazio and T. Caetano. A convex formulation for learning scale-free networks via submodular relaxation. Advances in neural information processing systems, 25, 2012.
  13. Multivariate granger causality analysis of fmri data. Human brain mapping, 30(4):1361–1373, 2009.
  14. Instantaneous and causal connectivity in resting state brain networks derived from functional mri data. Neuroimage, 54(2):1043–1052, 2011.
  15. High-dimensional inference: confidence intervals, p-values and r-software hdi. Statistical science, pages 533–558, 2015a.
  16. High-dimensional inference: Confidence intervals, p-values and R-software hdi. Statistical Science, 30(4):533–558, 2015b.
  17. Gaussian graphical models reveal inter-modal and inter-regional conditional dependencies of brain alterations in alzheimer’s disease. Frontiers in aging neuroscience, 12:99, 2020.
  18. Y. Fan and R. Li. Variable selection in linear mixed effects models. Annals of Statistics, 40(4):2043, 2012.
  19. Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3):432–441, 2008.
  20. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1):1–22, 2010. URL https://www.jstatsoft.org/v33/i01/.
  21. K. J. Friston. Functional and effective connectivity: a review. Brain connectivity, 1(1):13–36, 2011.
  22. The minimal preprocessing pipelines for the human connectome project. Neuroimage, 80:105–124, 2013.
  23. Investigating brain connectivity using mixed effects vector autoregressive models. NeuroImage, 59(4):3347–3355, 2012.
  24. Hierarchical vector auto-regressive models and their applications to multi-subject effective connectivity. Frontiers in computational neuroscience, 7:159, 2013.
  25. C. W. Granger. Investigating causal relations by econometric models and cross-spectral methods. Econometrica: journal of the Econometric Society, pages 424–438, 1969.
  26. Ica-based artefact removal and accelerated fmri acquisition for improved resting state network imaging. Neuroimage, 95:232–247, 2014.
  27. A direct estimation of high dimensional stationary vector autoregressions. Journal of Machine Learning Research, 2015.
  28. Comparative effects of alcohol and marijuana on mood, memory, and performance. Pharmacology Biochemistry and Behavior, 58(1):93–101, 1997.
  29. S. Holm. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, pages 65–70, 1979.
  30. J. Janková and S. van de Geer. Honest confidence regions and optimality in high-dimensional precision matrix estimation. Test, 26(1):143–162, 2017.
  31. J. Janková and S. van de Geer. Inference in high-dimensional graphical models. arXiv preprint arXiv:1801.08512, 2018.
  32. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC systems biology, 5(1):1–16, 2011.
  33. Default mode network functional connectivity in early and late mild cognitive impairment. Alzheimer Disease & Associated Disorders, 30(4):289–296, 2016.
  34. Inference for high-dimensional linear mixed-effects models: A quasi-likelihood approach. Journal of the American Statistical Association, pages 1–33, 2021.
  35. Doubly regularized estimation and selection in linear mixed-effects models for high-dimensional longitudinal data. Statistics and its interface, 11(4):721, 2018.
  36. Estimation of high-dimensional graphical models using regularized score matching. Electronic journal of statistics, 10(1):806, 2016.
  37. Statistical significance in high-dimensional linear mixed models. In Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference, pages 171–181, 2020.
  38. High-dimensional semiparametric gaussian copula graphical models. The Annals of Statistics, 40(4):2293–2326, 2012.
  39. Random coefficient first-order autoregressive models. Journal of Econometrics, 13(3):305–325, 1980.
  40. W. Liu. Gaussian graphical model estimation with false discovery rate control. The Annals of Statistics, 41(6):2948–2978, 2013.
  41. X. Liu and R. Chen. Threshold factor models for high-dimensional time series. Journal of Econometrics, 216(1):53–70, 2020.
  42. B. W. Matthews. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochimica et Biophysica Acta (BBA)-Protein Structure, 405(2):442–451, 1975.
  43. N. Meinshausen and P. Bühlmann. High-dimensional graphs and variable selection with the lasso. The Annals of Statistics, 34(3):1436–1462, 2006.
  44. Learning population and subject-specific brain connectivity networks via mixed neighborhood selection. The Annals of Applied Statistics, pages 2142–2164, 2017.
  45. Cross hippocampal influence in mesial temporal lobe epilepsy measured with high temporal resolution functional magnetic resonance imaging. Epilepsia, 52(9):1741–1749, 2011.
  46. J. A. Mumford and T. Nichols. Modeling and inference of multisubject fmri data. IEEE Engineering in Medicine and Biology Magazine, 25(2):42–51, 2006.
  47. B. Nandram and J. D. Petruccelli. A bayesian analysis of autoregressive time series panel data. Journal of Business & Economic Statistics, 15(3):328–334, 1997.
  48. M. Narayan and G. I. Allen. Mixed effects models for resampled network statistics improves statistical power to find differences in multi-subject functional connectivity. Frontiers in neuroscience, 10:108, 2016.
  49. A Unified Theory of Confidence Regions and Testing for High-Dimensional Estimating Equations. Statistical Science, 33(3):427 – 443, 2018. doi: 10.1214/18-STS661. URL https://doi.org/10.1214/18-STS661.
  50. A novel sparse group gaussian graphical model for functional connectivity estimation. In International Conference on Information Processing in Medical Imaging, pages 256–267. Springer, 2013.
  51. D. Nicholls and B. Quinn. The estimation of multivariate random coefficient autoregressive models. Journal of Multivariate Analysis, 11(4):544–555, 1981.
  52. Accurate autocorrelation modeling substantially improves fmri reliability. Nature communications, 10(1):1220, 2019.
  53. Functional graphical models. Journal of the American Statistical Association, 114(525):211–222, 2019.
  54. Improved greedy algorithms for learning graphical models. IEEE Transactions on Information Theory, 61(6):3457–3468, 2015.
  55. Random autoregressive models: A structured overview. Econometric Reviews, 41(2):207–230, 2022.
  56. Asymptotic normality and optimalities in estimation of large gaussian graphical models. The Annals of Statistics, 43(3):991–1026, 2015.
  57. A. Safikhani and A. Shojaie. Joint structural break detection and parameter estimation in high-dimensional nonstationary var models. Journal of the American Statistical Association, 117(537):251–264, 2022.
  58. A. Shojaie. Differential network analysis: A statistical perspective. Wiley Interdisciplinary Reviews: Computational Statistics, 2020.
  59. A. Shojaie and E. B. Fox. Granger causality: A review and recent advances. Annual Review of Statistics and Its Application, 9:289–319, 2022.
  60. A. Shojaie and G. Michailidis. Discovering graphical granger causality using the truncating lasso penalty. Bioinformatics, 26(18):i517–i523, 2010.
  61. Network modelling methods for fmri. Neuroimage, 54(2):875–891, 2011.
  62. Resting-state fmri in the human connectome project. Neuroimage, 80:144–168, 2013.
  63. Group-pca for very large fmri datasets. Neuroimage, 101:738–749, 2014.
  64. A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nature Neuroscience, 18(11):1565–1567, 2015.
  65. T. Sofer. Confidence intervals for heritability via haseman-elston regression. Statistical Applications in Genetics and Molecular Biology, 16(4):259–273, 2017.
  66. E. Solea and B. Li. Copula gaussian graphical models for functional data. Journal of the American Statistical Association, pages 1–13, 2020.
  67. O. Sporns. Brain connectivity. Scholarpedia, 2(10):4695, 2007. doi: 10.4249/scholarpedia.4695. revision #91084.
  68. The convex mixture distribution: Granger causality for categorical time series. SIAM Journal on Mathematics of Data Science, 3(1):83–112, 2021.
  69. W. F. Trench. Asymptotic distribution of the spectra of a class of generalized kac–murdock–szegö matrices. Linear algebra and its applications, 294(1-3):181–192, 1999.
  70. J. A. Tropp. An introduction to matrix concentration inequalities. arXiv preprint arXiv:1501.01571, 2015.
  71. The wu-minn human connectome project: an overview. Neuroimage, 80:62–79, 2013.
  72. P. Vaněček. Estimators of random coefficient autoregressive models. 2008.
  73. Graph estimation with joint additive models. Biometrika, 101(1):85–101, 2014.
  74. Investigating inter-individual differences in short-term intra-individual variability. Psychological methods, 17(4):567, 2012.
  75. W. Wang. Identifiability of linear mixed effects models. Electronic Journal of Statistics, 7:244–263, 2013.
  76. Conditionally specified continuous distributions. Biometrika, 95(3):735–746, 2008.
  77. S. Whitfield-Gabrieli and J. M. Ford. Default mode network activity and connectivity in psychopathology. Annual review of clinical psychology, 8:49–76, 2012.
  78. Temporal autocorrelation in univariate linear modeling of fmri data. Neuroimage, 14(6):1370–1386, 2001.
  79. Statistical harmonization corrects site effects in functional connectivity measurements from multi-site fmri data. Human brain mapping, 39(11):4213–4227, 2018.
  80. Generalized score matching for non-negative data. The Journal of Machine Learning Research, 20(1):2779–2848, 2019.
  81. M. Yuan and Y. Lin. Model selection and estimation in the gaussian graphical model. Biometrika, 94(1):19–35, 2007.
  82. Confidence intervals for low dimensional parameters in high dimensional linear models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76(1):217–242, 2014.
  83. Penalized estimation of threshold auto-regressive models with many components and thresholds. Electronic Journal of Statistics, 16(1):1891–1951, 2022.
  84. Combat-seq: batch effect adjustment for rna-seq count data. NAR genomics and bioinformatics, 2(3):lqaa078, 2020.
  85. H. Zhao and Z.-H. Duan. Cancer genetic network inference using gaussian graphical models. Bioinformatics and biology insights, 13:1177932219839402, 2019.
  86. Task fmri paradigms may capture more behaviorally relevant information than resting-state functional connectivity. Neuroimage, 270:119946, 2023.
  87. L. Zheng and G. Raskutti. Testing for high-dimensional network parameters in auto-regressive models. Electronic Journal of Statistics, 13(2):4977–5043, 2019.
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Ai Generate Text Spark Streamline Icon: https://streamlinehq.com

Paper Prompts

Sign up for free to create and run prompts on this paper using GPT-5.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com