Network reconstruction via the minimum description length principle (2405.01015v3)
Abstract: A fundamental problem associated with the task of network reconstruction from dynamical or behavioral data consists in determining the most appropriate model complexity in a manner that prevents overfitting, and produces an inferred network with a statistically justifiable number of edges. The status quo in this context is based on $L_{1}$ regularization combined with cross-validation. However, besides its high computational cost, this commonplace approach unnecessarily ties the promotion of sparsity with weight "shrinkage". This combination forces a trade-off between the bias introduced by shrinkage and the network sparsity, which often results in substantial overfitting even after cross-validation. In this work, we propose an alternative nonparametric regularization scheme based on hierarchical Bayesian inference and weight quantization, which does not rely on weight shrinkage to promote sparsity. Our approach follows the minimum description length (MDL) principle, and uncovers the weight distribution that allows for the most compression of the data, thus avoiding overfitting without requiring cross-validation. The latter property renders our approach substantially faster to employ, as it requires a single fit to the complete data. As a result, we have a principled and efficient inference scheme that can be used with a large variety of generative models, without requiring the number of edges to be known in advance. We also demonstrate that our scheme yields systematically increased accuracy in the reconstruction of both artificial and empirical networks. We highlight the use of our method with the reconstruction of interaction networks between microbial communities from large-scale abundance samples involving in the order of $10{4}$ to $10{5}$ species, and demonstrate how the inferred model can be used to predict the outcome of interventions in the system.
- T. Bury, A statistical physics perspective on criticality in financial markets, Journal of Statistical Mechanics: Theory and Experiment 2013, P11004 (2013), arxiv:1310.2446 [physics, q-fin] .
- P. D’haeseleer, S. Liang, and R. Somogyi, Genetic network inference: From co-expression clustering to reverse engineering, Bioinformatics 16, 707 (2000).
- G. Stolovitzky, D. Monroe, and A. Califano, Dialogue on Reverse-Engineering Assessment and Methods, Annals of the New York Academy of Sciences 1115, 1 (2007).
- Braunstein Alfredo, Ingrosso Alessandro, and Muntoni Anna Paola, Network reconstruction from infection cascades, Journal of The Royal Society Interface 16, 20180844 (2019).
- L. Peel, T. P. Peixoto, and M. De Domenico, Statistical inference links data and theory in network science, Nature Communications 13, 6794 (2022).
- J. Sun, D. Taylor, and E. M. Bollt, Causal Network Inference by Optimal Causation Entropy, SIAM Journal on Applied Dynamical Systems 10.1137/140956166 (2015).
- E. Bullmore and O. Sporns, Complex brain networks: Graph theoretical analysis of structural and functional systems, Nature Reviews Neuroscience 10, 186 (2009).
- R. Tibshirani, Regression Shrinkage and Selection Via the Lasso, Journal of the Royal Statistical Society: Series B (Methodological) 58, 267 (1996).
- N. Meinshausen and P. Bühlmann, High-dimensional graphs and variable selection with the Lasso, The Annals of Statistics 34, 1436 (2006).
- M. Yuan and Y. Lin, Model selection and estimation in the Gaussian graphical model, Biometrika 94, 19 (2007).
- O. Banerjee, L. E. Ghaoui, and A. d’Aspremont, Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data, Journal of Machine Learning Research 9, 485 (2008).
- L. Augugliaro, A. Abbruzzo, and V. Vinciotti, ℓℓ\ellroman_ℓ1-Penalized censored Gaussian graphical model, Biostatistics 21, e1 (2020).
- J. Friedman, T. Hastie, and R. Tibshirani, Sparse inverse covariance estimation with the graphical lasso, Biostatistics 9, 432 (2008).
- P. Ravikumar, M. J. Wainwright, and J. D. Lafferty, High-dimensional Ising model selection using ℓℓ\ellroman_ℓ1-regularized logistic regression, The Annals of Statistics 38, 1287 (2010).
- H. C. Nguyen, R. Zecchina, and J. Berg, Inverse statistical problems: From the inverse Ising problem to data science, Advances in Physics 66, 197 (2017).
- E. Aurell and M. Ekeberg, Inverse Ising Inference Using All the Data, Physical Review Letters 108, 090201 (2012).
- A. Decelle and F. Ricci-Tersenghi, Pseudolikelihood Decimation Algorithm Improving the Inference of the Interaction Network in a General Class of Ising Models, Physical Review Letters 112, 070603 (2014).
- R. Foygel and M. Drton, Extended Bayesian Information Criteria for Gaussian Graphical Models, in Advances in Neural Information Processing Systems, Vol. 23 (Curran Associates, Inc., 2010).
- H. Wang, Bayesian Graphical Lasso Models and Efficient Posterior Computation, Bayesian Analysis 7, 867 (2012).
- A. Mohammadi and E. C. Wit, Bayesian Structure Learning in Sparse Gaussian Graphical Models, Bayesian Analysis 10, 109 (2015).
- A. Dobra, A. Lenkoski, and A. Rodriguez, Bayesian Inference for General Gaussian Graphical Models With Application to Multivariate Lattice Data, Journal of the American Statistical Association 106, 1418 (2011).
- H. Wang, Scaling It Up: Stochastic Search Structure Learning in Graphical Models, Bayesian Analysis 10, 351 (2015).
- Z. Richard Li, T. H. McCormick, and S. J. Clark, Bayesian Joint Spike-and-Slab Graphical Lasso, Proceedings of machine learning research 97, 3877 (2019).
- C. M. Carvalho, N. G. Polson, and J. G. Scott, Handling Sparsity via the Horseshoe, in Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics (PMLR, 2009) pp. 73–80.
- Y. Li, B. A. Craig, and A. Bhadra, The Graphical Horseshoe Estimator for Inverse Covariance Matrices, Journal of Computational and Graphical Statistics 28, 747 (2019).
- Z. R. Li, T. H. McComick, and S. J. Clark, Using Bayesian Latent Gaussian Graphical Models to Infer Symptom Associations in Verbal Autopsies, Bayesian analysis 15, 781 (2020).
- J. Rissanen, Information and Complexity in Statistical Modeling, 1st ed. (Springer, 2010).
- P. D. Grünwald, The Minimum Description Length Principle (The MIT Press, 2007).
- T. P. Peixoto, Scalable network reconstruction in subquadratic time (2024), arxiv:2401.01404 [physics, stat] .
- A. N. Tikhonov, On the stability of inverse problems, in Dokl. Akad. Nauk Sssr, Vol. 39 (1943) pp. 195–198.
- W. W. Zachary, An Information Flow Model for Conflict and Fission in Small Groups, Journal of Anthropological Research 33, 452 (1977), 3629752 .
- S. J. Wright, Coordinate descent algorithms, Mathematical Programming 151, 3 (2015).
- T. P. Peixoto, Merge-split Markov chain Monte Carlo for community detection, Physical Review E 102, 012305 (2020).
- T. P. Peixoto, The graph-tool python library, figshare 10.6084/m9.figshare.1164194 (2014a), available at https://graph-tool.skewed.de.
- J. Moody, Peer influence groups: Identifying dense clusters in large networks, Social Networks 23, 261 (2001).
- M. Girvan and M. E. J. Newman, Community structure in social and biological networks, Proceedings of the National Academy of Sciences 99, 7821 (2002).
- T. P. Peixoto, Network Reconstruction and Community Detection from Dynamics, Physical Review Letters 123, 128301 (2019a).
- B. Karrer and M. E. J. Newman, Stochastic blockmodels and community structure in networks, Physical Review E 83, 016107 (2011).
- T. P. Peixoto, Nonparametric Bayesian inference of the microcanonical stochastic block model, Physical Review E 95, 012317 (2017).
- T. P. Peixoto, Hierarchical Block Structures and High-Resolution Model Selection in Large Networks, Physical Review X 4, 011047 (2014b).
- N. Connor, A. Barberán, and A. Clauset, Using null models to infer microbial co-occurrence networks, PLOS ONE 12, e0176751 (2017).
- M. Layeghifard, D. M. Hwang, and D. S. Guttman, Disentangling Interactions in the Microbiome: A Network Perspective, Trends in Microbiology 25, 217 (2017).
- J. A. Gilbert, F. Meyer, J. Jansson, J. Gordon, N. Pace, J. Tiedje, R. Ley, N. Fierer, D. Field, N. Kyrpides, F.-O. Glöckner, H.-P. Klenk, K. E. Wommack, E. Glass, K. Docherty, R. Gallery, R. Stevens, and R. Knight, The Earth Microbiome Project: Meeting report of the “1st EMP meeting on sample selection and acquisition” at Argonne National Laboratory October 6th 2010, Standards in Genomic Sciences 3, 249 (2010).
- R. T. Paine, A Note on Trophic Complexity and Community Stability, The American Naturalist 103, 91 (1969).
- L. S. Mills, M. E. Soulé, and D. F. Doak, The Keystone-Species Concept in Ecology and Conservation, BioScience 43, 219 (1993), 1312122 .
- V. Vinciotti, E. Wit, and F. Richter, Random graphical model of microbiome interactions in related environments (2023), arxiv:2304.01956 [stat] .
- T. P. Peixoto, Bayesian Stochastic Blockmodeling, in Advances in Network Clustering and Blockmodeling (John Wiley & Sons, Ltd, 2019) pp. 289–332.
- IEEE Standard for Floating-Point Arithmetic, IEEE Std 754-2008 , 1 (2008).
- J. Besag, Spatial Interaction and the Statistical Analysis of Lattice Systems, Journal of the Royal Statistical Society: Series B (Methodological) 36, 192 (1974).
- A. Mozeika, O. Dikmen, and J. Piili, Consistent inference of a general model using the pseudolikelihood method, Physical Review E 90, 010101 (2014).