Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Flow Divergence: Comparing Maps of Flows with Relative Entropy (2401.09052v1)

Published 17 Jan 2024 in cs.SI and physics.soc-ph

Abstract: Networks represent how the entities of a system are connected and can be partitioned differently, prompting ways to compare partitions. Common approaches for comparing network partitions include information-theoretic measures based on mutual information and set-theoretic measures such as the Jaccard index. These measures are often based on computing the agreement in terms of overlap between different partitions of the same set. However, they ignore link patterns which are essential for the organisation of networks. We propose flow divergence, an information-theoretic divergence measure for comparing network partitions, inspired by the ideas behind the Kullback-Leibler divergence and the map equation for community detection. Similar to the Kullback-Leibler divergence, flow divergence adopts a coding perspective and compares two network partitions $\mathsf{M}_a$ and $\mathsf{M}_b$ by considering the expected extra number of bits required to describe a random walk on a network using $\mathsf{M}_b$ relative to reference partition $\mathsf{M}_a$. Because flow divergence is based on random walks, it can be used to compare partitions with arbitrary and different depths. We show that flow divergence distinguishes between partitions that traditional measures consider to be equally good when compared to a reference partition. Applied to real networks, we use flow divergence to estimate the cost of overfitting in incomplete networks and to visualise the solution landscape of network partitions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Santo Fortunato. Community detection in graphs. Physics Reports, 486(3):75–174, 2010. ISSN 0370-1573. doi: 10.1016/j.physrep.2009.11.002.
  2. Hanneke van der Hoef and Matthijs J. Warrens. Understanding information theoretic measures for comparing clusterings. Behaviormetrika, 46(2):353–370, Oct 2019. ISSN 1349-6964. doi: 10.1007/s41237-018-0075-7.
  3. Paul Jaccard. The distribution of the flora in the alpine zone. New Phytologist, 11(2):37–50, 1912. doi: https://doi.org/10.1111/j.1469-8137.1912.tb05611.x.
  4. Elements of information theory, second edition. John Wiley & Sons, 2006. ISBN 978-0-471-24195-9.
  5. A bi-directional approach to comparing the modular structure of networks. EPJ Data Science, 10(1):13, Mar 2021. ISSN 2193-1127. doi: 10.1140/epjds/s13688-021-00269-8.
  6. A network community structure similarity index for weighted networks. PLOS ONE, 18(11):1–17, 11 2023. doi: 10.1371/journal.pone.0292018.
  7. Standardized mutual information for clustering comparisons: One step further in adjustment for chance. In Eric P. Xing and Tony Jebara, editors, Proceedings of the 31st International Conference on Machine Learning, volume 32 of Proceedings of Machine Learning Research, pages 1143–1151, Bejing, China, 22–24 Jun 2014. PMLR.
  8. Hierarchical mutual information for the comparison of hierarchical community structures in complex networks. Phys. Rev. E, 92:062825, Dec 2015. doi: 10.1103/PhysRevE.92.062825.
  9. Tracking network dynamics: a survey of distances and similarity metrics, 2018.
  10. Mapping robust multiscale communities in chromosome contact networks. Scientific Reports, 13(1):12979, Aug 2023. ISSN 2045-2322. doi: 10.1038/s41598-023-39522-7.
  11. Comparing community structure to characteristics in online collegiate social networks. SIAM Review, 53(3):526–543, 2011. doi: 10.1137/080734315.
  12. Comparing network covers using mutual information, 2012.
  13. Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences, 105(4):1118–1123, 2008. doi: 10.1073/pnas.0706851105.
  14. Similarity-based link prediction from modular compression of network flows. In Bastian Rieck and Razvan Pascanu, editors, Proceedings of the First Learning on Graphs Conference, volume 198 of Proceedings of Machine Learning Research, pages 52:1–52:18. PMLR, 09–12 Dec 2022.
  15. M. E. J. Newman. Modularity and community structure in networks. Proceedings of the National Academy of Sciences, 103(23):8577–8582, 2006. doi: 10.1073/pnas.0601602103.
  16. C. E. Shannon. A mathematical theory of communication. Bell Syst. Tech. J., 27:379–423, 1948.
  17. David A Huffman. A method for the construction of minimum-redundancy codes. Proceedings of the IRE, 40(9):1098–1101, 1952. doi: 10.1109/JRPROC.1952.273898.
  18. David F. Gleich. Pagerank beyond the web. SIAM Review, 57(3):321–363, 2015. doi: 10.1137/140976649.
  19. R. Lambiotte and M. Rosvall. Ranking and clustering of nodes in networks with smart teleportation. Phys. Rev. E, 85:056107, May 2012. doi: 10.1103/PhysRevE.85.056107.
  20. Community detection with the map equation and infomap: Theory and applications. arXiv preprint arXiv:2311.04036, 2023.
  21. M. Rosvall and C. T. Bergstrom. Multilevel Compression of Random Walks on Networks Reveals Hierarchical Organization in Large Integrated Systems. PLoS One, 6:e18209, 2011.
  22. Mapping flows on weighted and directed networks with incomplete observations. Journal of Complex Networks, 9(6), 2021. doi: 10.1093/comnet/cnab044.
  23. Mapping Higher-Order Network Flows in Memory and Multilayer Networks with Infomap. Algorithms, 10:112, 2017.
  24. The MapEquation software package. https://mapequation.org.
  25. M. Girvan and M. E. J. Newman. Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12):7821–7826, 2002. doi: 10.1073/pnas.122653799.
  26. Community structure in jazz. Advances in Complex Systems, 06(04):565–573, 2003. doi: 10.1142/S0219525903001067.
  27. Interaction data from the copenhagen networks study. Scientific Data, 6(1):315, Dec 2019. ISSN 2052-4463. doi: 10.1038/s41597-019-0325-x.
  28. Organization mining using online social networks. Networks and Spatial Economics, 16(2):545–578, Jun 2016. ISSN 1572-9427. doi: 10.1007/s11067-015-9288-4.
  29. Umap: Uniform manifold approximation and projection for dimension reduction, 2020.
  30. Density-based clustering based on hierarchical density estimates. In Jian Pei, Vincent S. Tseng, Longbing Cao, Hiroshi Motoda, and Guandong Xu, editors, Advances in Knowledge Discovery and Data Mining, pages 160–172, Berlin, Heidelberg, 2013. Springer Berlin Heidelberg. ISBN 978-3-642-37456-2.
  31. Evaluating overfit and underfit in models of network community structure. IEEE Transactions on Knowledge and Data Engineering, 32(9):1722–1735, 2020. doi: 10.1109/TKDE.2019.2911585.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com