Community detection on directed networks with missing edges (2410.19651v1)
Abstract: Identifying significant community structures in networks with incomplete data is a challenging task, as the reliability of solutions diminishes with increasing levels of missing information. However, in many empirical contexts, some information about the uncertainty in the network measurements can be estimated. In this work, we extend the recently developed Flow Stability framework, originally designed for detecting communities in time-varying networks, to address the problem of community detection in weighted, directed networks with missing links. Our approach leverages known uncertainty levels in nodes' out-degrees to enhance the robustness of community detection. Through comparisons on synthetic networks and a real-world network of messaging channels on the Telegram platform, we demonstrate that our method delivers more reliable community structures, even when a significant portion of data is missing.
- The architecture of complex weighted networks. Proceedings of the National Academy of Sciences, 101(11):3747–3752, 2004.
- Steven H Strogatz. Exploring complex networks. nature, 410(6825):268–276, 2001.
- Self-organization and identification of web communities. Computer, 35(3):66–70, 2002.
- Detecting rich-club ordering in complex networks. Nature Physics, 2(2):110–115, 2006.
- Measuring contact patterns with wearable sensors: methods, data characteristics and applications to data-driven simulations of infectious diseases. Clinical Microbiology and Infection, 20(1):10–16, 2014.
- Dynamics of person-to-person interactions from distributed rfid sensor networks. PLOS ONE, 5(7):1–9, 07 2010.
- The rich club of the c. elegans neuronal connectome. Journal of Neuroscience, 33(15):6380–6387, 2013.
- Network neuroscience. Nature neuroscience, 20(3):353–364, 2017.
- Ed Bullmore and Olaf Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature reviews neuroscience, 10(3):186–198, 2009.
- Global landscape of protein complexes in the yeast saccharomyces cerevisiae. Nature, 440(7084):637–643, 2006.
- Modeling social networks from sampled data. The Annals of Applied Statistics, 4(1):5 – 25, 2010.
- Peter Killworth and H. Bernard. Informant Accuracy in Social Network Data. Human Organization, 35(3):269–286, 08 2008.
- M. E. J. Newman. Network structure from rich but noisy data. Nature Physics, 14(6):542–545, 2018.
- Bayesian inference of network structure from unreliable data. Journal of Complex Networks, 8(6):cnaa046, 03 2021.
- M. E. J. Newman. Analysis of weighted networks. Phys. Rev. E, 70:056131, Nov 2004.
- Community detection in networks: A user guide. Physics reports, 659:1–44, 2016.
- Community structure in directed networks. Physical review letters, 100(11):118703, 2008.
- Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10):P10008, oct 2008.
- Evaluating overfit and underfit in models of network community structure. IEEE Transactions on Knowledge and Data Engineering, 32(9):1722–1735, 2020.
- Stacking models for nearly optimal link prediction in complex networks. Proceedings of the National Academy of Sciences, 117(38):23393–23400, 2020.
- Tiago P. Peixoto. Reconstructing networks with unknown and heterogeneous errors. Phys. Rev. X, 8:041011, Oct 2018.
- Statistical significance of communities in networks. Phys. Rev. E, 81:046110, Apr 2010.
- Finding statistically significant communities in networks. PLOS ONE, 6(4):1–18, 04 2011.
- Mapping flows on sparse networks with missing links. Phys. Rev. E, 102:012302, Jul 2020.
- Mapping flows on weighted and directed networks with incomplete observations. Journal of Complex Networks, 9(6):cnab044, 12 2021.
- Flow stability for dynamic community detection. Science Advances, 8(19):eabj3063, 2022.
- Petter Holme. Modern temporal network theory: a colloquium. The European Physical Journal B, 88(9):234, 2015.
- Temporal networks. Physics Reports, 519:97–125, 2012.
- A guide to temporal networks. Series on Complexity Science, 06 2020.
- Detectability thresholds and optimal algorithms for community structure in dynamic networks. Physical Review X, 6(3):031005, 2016.
- Community discovery in dynamic networks: a survey. ACM computing surveys (CSUR), 51(2):1–37, 2018.
- Renaud Lambiotte. Continuous-Time Random Walks and Temporal Networks, pages 225–239. Springer International Publishing, Cham, 2023.
- Random walks, markov processes and the multiscale modular organization of complex networks. IEEE Transactions on Network Science and Engineering, 1(2):76–90, 2014.
- Stability of graph communities across time scales. Proceedings of the National Academy of Sciences, 107(29):12755–12760, 2010.
- Organization and evolution of the uk far-right network on telegram. Applied Network Science, 7(1):76, 2022.
- Stochastic blockmodels: First steps. Social Networks, 5(2):109–137, 1983.
- Tiago P. Peixoto. Entropy of stochastic blockmodel ensembles. Phys. Rev. E, 85:056122, May 2012.
- Asymptotic analysis of the stochastic block model for modular networks and its algorithmic applications. Phys. Rev. E, 84:066106, Dec 2011.
- Flow graphs: Interweaving dynamics and structure. Phys. Rev. E, 84:017102, Jul 2011.
- The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1):107–117, 1998. Proceedings of the Seventh International World Wide Web Conference.
- Supervised random walks: predicting and recommending links in social networks. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM ’11, page 635–644, New York, NY, USA, 2011. Association for Computing Machinery.
- Dirichletrank: Solving the zero-one gap problem of pagerank. ACM Trans. Inf. Syst., 26:10:1–10:29, 2008.
- Modularity and dynamics on complex networks. Cambridge University Press, 2021.
- A similarity measure for indefinite rankings. ACM Trans. Inf. Syst., 28(4), November 2010.