Papers
Topics
Authors
Recent
Search
2000 character limit reached

FastEnsemble: scalable ensemble clustering on large networks

Published 3 Sep 2024 in cs.SI | (2409.02077v2)

Abstract: Many community detection algorithms are inherently stochastic, leading to variations in their output depending on input parameters and random seeds. This variability makes the results of a single run of these algorithms less reliable. Moreover, different clustering algorithms, optimization criteria (e.g., modularity, the Constant Potts model), and resolution values can result in substantially different partitions on the same network. Consensus clustering methods, such as ECG and FastConsensus, have been proposed to reduce the instability of non-deterministic algorithms and improve their accuracy by combining a set of partitions resulting from multiple runs of a clustering algorithm. In this work, we introduce FastEnsemble, a new consensus clustering method. Our results on a wide range of synthetic networks show that FastEnsemble produces more accurate clusterings than two other consensus clustering methods, ECG and FastConsensus, for many model conditions. Furthermore, FastEnsemble is fast enough to be used on networks with more than 3 million nodes, and so improves on the speed and scalability of FastConsensus. Finally, we showcase the utility of consensus clustering methods in mitigating the effect of resolution limit and clustering networks that are only partially covered by communities.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (24)
  1. arXiv preprint arXiv:2408.13647 (2024)
  2. Journal of Statistical Mechanics: Theory and Experiment 2008(10), P10,008 (2008)
  3. Quantitative Science Studies pp. 1–25 (2022)
  4. Publicationes Mathematicae 6(3-4), 290–297 (1959)
  5. Fortunato, S.: Community detection in graphs. Physics Reports 486(3-5), 75–174 (2010)
  6. Proceedings of the National Academy of Sciences 104(1), 36–41 (2007)
  7. arXiv preprint arXiv:2408.11331 (2024)
  8. Scientific reports 8(1), 1–16 (2018)
  9. Knowledge-Based Systems 195, 105,626 (2020)
  10. Network Science 9(2), 153–178 (2021)
  11. Scientific reports 2(1), 1–7 (2012)
  12. Physical review E 78(4), 046,110 (2008)
  13. Newman, M.E.: Mixing patterns in networks. Physical review E 67(2), 026,126 (2003)
  14. Physical review E 69(2), 026,113 (2004)
  15. PLOS Complex Systems (2024). In Press (journal version of Complex Networks and Applications 2023 paper)
  16. the Journal of machine Learning research 12, 2825–2830 (2011)
  17. Applied Network Science 4(1), 51 (2019)
  18. Physical Review E 81(4), 046,114 (2010)
  19. https://github.com/ytabatabaee/fast-ensemble
  20. 10.5281/zenodo.13625629. URL https://doi.org/10.5281/zenodo.13625629. Zenodo
  21. Physical Review E 99(4), 042,301 (2019)
  22. Physical Review E 84(1), 016,114 (2011)
  23. Scientific reports 9(1), 1–12 (2019)
  24. Scientific reports 6(1), 1–18 (2016)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.