Estimating mixed memberships in multi-layer networks (2404.03916v2)
Abstract: Community detection in multi-layer networks has emerged as a crucial area of modern network analysis. However, conventional approaches often assume that nodes belong exclusively to a single community, which fails to capture the complex structure of real-world networks where nodes may belong to multiple communities simultaneously. To address this limitation, we propose novel spectral methods to estimate the common mixed memberships in the multi-layer mixed membership stochastic block model. The proposed methods leverage the eigen-decomposition of three aggregate matrices: the sum of adjacency matrices, the debiased sum of squared adjacency matrices, and the sum of squared adjacency matrices. We establish rigorous theoretical guarantees for the consistency of our methods. Specifically, we derive per-node error rates under mild conditions on network sparsity, demonstrating their consistency as the number of nodes and/or layers increases under the multi-layer mixed membership stochastic block model. Our theoretical results reveal that the method leveraging the sum of adjacency matrices generally performs poorer than the other two methods for mixed membership estimation in multi-layer networks. We conduct extensive numerical experiments to empirically validate our theoretical findings. For real-world multi-layer networks with unknown community information, we introduce two novel modularity metrics to quantify the quality of mixed membership community detection. Finally, we demonstrate the practical applications of our algorithms and modularity metrics by applying them to real-world multi-layer networks, demonstrating their effectiveness in extracting meaningful community structures.
- Mixed membership stochastic blockmodels. Journal of Machine Learning Research, 9, 1981–2014.
- A tensor approach to learning mixed membership community models. The Journal of Machine Learning Research, 15, 2239–2312.
- The successive projections algorithm for variable selection in spectroscopic multicomponent analysis. Chemometrics and intelligent laboratory systems, 57, 65–73.
- A comprehensive transcriptional map of primate brain development. Nature, 535, 367–375.
- The structure and dynamics of multilayer networks. Physics reports, 544, 1–122.
- The two-to-infinity norm and singular subspace geometry with applications to high-dimensional statistics. Annals of Statistics, 47, 2405–2439.
- Wiring optimization can relate neuronal structure and function. Proceedings of the National Academy of Sciences, 103, 4723–4728.
- Spectral methods for data science: A statistical perspective. Foundations and Trends® in Machine Learning, 14, 566–806.
- Community detection for multilayer weighted networks. Information Sciences, 595, 119–141.
- Structural reducibility of multilayer networks. Nature communications, 6, 6864.
- Fortunato, S. (2010). Community detection in graphs. Physics reports, 486, 75–174.
- Community detection in networks: A user guide. Physics reports, 659, 1–44.
- Fast and robust recursive algorithmsfor separable nonnegative matrix factorization. IEEE transactions on pattern analysis and machine intelligence, 36, 698–714.
- Semidefinite programming based preconditioning for more robust near-separable nonnegative matrix factorization. SIAM Journal on Optimization, 25, 677–698.
- Efficient discovery of overlapping communities in massive networks. Proceedings of the National Academy of Sciences of the United States of America, 110, 14534–14539.
- Consistent estimation of dynamic and multi-layer block models. In International Conference on Machine Learning (pp. 1511–1520). PMLR.
- Stochastic blockmodels: First steps. Social networks, 5, 109–137.
- A survey of community detection methods in multilayer networks. Data Mining and Knowledge Discovery, 35, 1–45.
- Community detection in networks: A multidisciplinary review. Journal of Network and Computer Applications, 108, 87–111.
- Mixed membership estimation for social networks. Journal of Econometrics, 239, 105369.
- Community detection on mixture multilayer networks via regularized tensor decomposition. The Annals of Statistics, 49, 3181–3205.
- Using svd for topic modeling. Journal of the American Statistical Association, 119, 434–449.
- Community detection in multi-layer graphs: A survey. ACM SIGMOD Record, 44, 37–48.
- Multilayer networks. Journal of complex networks, 2, 203–271.
- Lazega, E. (2001). The collegial phenomenon: The social mechanisms of cooperation among peers in a corporate law partnership. Oxford University Press, USA.
- Consistent community detection in multi-layer network data. Biometrika, 107, 61–73.
- Bias-adjusted spectral clustering in multi-layer stochastic block models. Journal of the American Statistical Association, 118, 2433–2445.
- Combinatorial analysis of multiple networks. arXiv preprint arXiv:1303.4986, .
- Estimating mixed memberships with sharp eigenvector deviations. Journal of the American Statistical Association, 116, 1928–1940.
- Community structure in time-dependent, multiscale, and multiplex networks. science, 328, 876–878.
- Simultaneous clustering of multiple gene expression and physical interaction datasets. PLoS computational biology, 6, e1000742.
- Fuzzy communities and the concept of bridgeness in complex networks. Physical Review E, 77, 016107.
- Newman, M. E. (2002). Assortative mixing in networks. Physical Review Letters, 89, 208701.
- Newman, M. E. (2003a). Mixing patterns in networks. Physical Review E, 67, 026126.
- Newman, M. E. (2003b). The structure and function of complex networks. SIAM review, 45, 167–256.
- Finding and evaluating community structure in networks. Physical review E, 69, 026113.
- Spectral and matrix factorization methods for consistent community detection in multi-layer networks. The Annals of Statistics, 48, 230 – 250.
- Null models and community detection in multi-layer networks. Sankhya A, (pp. 1–55).
- Spectral clustering in the dynamic stochastic block model. Electronic Journal of Statistics, 13, 678 – 709.
- Overlapping community detection using bayesian non-negative matrix factorization. Physical Review E, 83, 066114.
- Regularized spectral clustering under the mixed membership stochastic block model. Neurocomputing, 550, 126490.
- Bipartite mixed membership distribution-free model. a novel model for community detection in overlapping bipartite weighted networks. Expert Systems with Applications, 235, 121088.
- New specifications for exponential random graph models. Sociological methodology, 36, 99–153.
- Spectral co-clustering in rank-deficient multi-layer stochastic co-block models. arXiv preprint arXiv:2307.10572, .
- Tropp, J. A. (2012). User-friendly tail bounds for sums of random matrices. Foundations of computational mathematics, 12, 389–434.
- Community discovery using nonnegative matrix factorization. Data Mining and Knowledge Discovery, 22, 493–521.
- Finding common modules in a time-varying network with application to the drosophila melanogaster gene regulation network. Journal of the American Statistical Association, 112, 994–1008.