Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
157 tokens/sec
GPT-4o
43 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Maximum Likelihood Estimation on Stochastic Blockmodels for Directed Graph Clustering (2403.19516v1)

Published 28 Mar 2024 in stat.ML, cs.LG, cs.SI, math.ST, and stat.TH

Abstract: This paper studies the directed graph clustering problem through the lens of statistics, where we formulate clustering as estimating underlying communities in the directed stochastic block model (DSBM). We conduct the maximum likelihood estimation (MLE) on the DSBM and thereby ascertain the most probable community assignment given the observed graph structure. In addition to the statistical point of view, we further establish the equivalence between this MLE formulation and a novel flow optimization heuristic, which jointly considers two important directed graph statistics: edge density and edge orientation. Building on this new formulation of directed clustering, we introduce two efficient and interpretable directed clustering algorithms, a spectral clustering algorithm and a semidefinite programming based clustering algorithm. We provide a theoretical upper bound on the number of misclustered vertices of the spectral clustering algorithm using tools from matrix perturbation theory. We compare, both quantitatively and qualitatively, our proposed algorithms with existing directed clustering methods on both synthetic and real-world data, thus providing further ground to our theoretical contributions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (58)
  1. Exact recovery in the stochastic block model. IEEE Transactions on information theory, 62(1):471–487, 2015.
  2. Systemic risk and stability in financial networks. American Economic Review, 105(2):564–608, 2015.
  3. The political blogosphere and the 2004 us election: divided they blog. In Proceedings of the 3rd international workshop on Link discovery, pages 36–43, 2005.
  4. On semidefinite relaxations for the block model. 2018.
  5. Pseudo-likelihood methods for community detection in large sparse networks. 2013.
  6. Characterizing and mining the citation graph of the computer science literature. Knowledge and Information Systems, 6:664–678, 2004.
  7. k-means++: The advantages of careful seeding. In Soda, volume 7, pages 1027–1035, 2007.
  8. Community recovery in non-binary and temporal stochastic block models. arXiv preprint arXiv:2008.04790, 2020.
  9. Lead–lag detection and network clustering for multivariate time series with an application to the us equity market. Machine Learning, 111(12):4497–4538, 2022.
  10. Nicolas Boumal. An introduction to optimization on smooth manifolds. Cambridge University Press, 2023.
  11. The non-convex burer-monteiro approach works on smooth semidefinite programs. Advances in Neural Information Processing Systems, 29, 2016.
  12. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine learning, 3(1):1–122, 2011.
  13. A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization. Mathematical programming, 95(2):329–357, 2003.
  14. Improved graph clustering. IEEE Transactions on Information Theory, 60(10):6440–6455, 2014.
  15. Spectral methods for data science: A statistical perspective. Foundations and Trends® in Machine Learning, 14(5):566–806, 2021.
  16. Simple and scalable constrained clustering: a generalized spectral method. In Artificial Intelligence and Statistics, pages 445–454. PMLR, 2016.
  17. Hermitian matrices for clustering directed graphs: insights and applications. In International Conference on Artificial Intelligence and Statistics, pages 983–992. PMLR, 2020. URL http://128.84.4.34/pdf/1908.02096.
  18. The rotation of eigenvectors by a perturbation. iii. SIAM Journal on Numerical Analysis, 7(1):1–46, 1970.
  19. Asymptotic analysis of the stochastic block model for modular networks and its algorithmic applications. Physical review E, 84(6):066106, 2011.
  20. Magnetic eigenmaps for community detection in directed networks. Physical Review E, 95(2):022302, 2017.
  21. Magnetic eigenmaps for the visualization of directed networks. Applied and Computational Harmonic Analysis, 44(1):189–199, 2018.
  22. Introduction to random graphs. Cambridge University Press, 2016.
  23. The impact of random models on clustering similarity. arXiv preprint arXiv:1701.06508, 2017.
  24. Achieving exact cluster recovery threshold via semidefinite programming: Extensions. IEEE Transactions on Information Theory, 62(10):5918–5937, 2016.
  25. Skew-symmetric adjacency matrices for clustering directed graphs. arXiv preprint arXiv:2203.01388, 2022. URL https://arxiv.org/pdf/2203.01388.pdf.
  26. MSGNN: A Spectral Graph Neural Network Based on a Novel Magnetic Signed Laplacian. In Bastian Rieck and Razvan Pascanu, editors, Proceedings of the First Learning on Graphs Conference, volume 198 of Proceedings of Machine Learning Research, pages 40:1–40:39. PMLR, 09–12 Dec 2022. URL https://proceedings.mlr.press/v198/he22c.html.
  27. Stochastic blockmodels: First steps. Social networks, 5(2):109–137, 1983.
  28. Wolfram Research, Inc. Mathematica, Version 14.0. URL https://www.wolfram.com/mathematica. Champaign, IL, 2024.
  29. Impact of regularization on spectral clustering. 2016.
  30. Stochastic blockmodels and community structure in networks. Physical review E, 83(1):016107, 2011.
  31. Maxwell Mirton Kessler. Bibliographic coupling between scientific papers. American documentation, 14(1):10–25, 1963.
  32. Steinar Laenen. Directed graph clustering using hermitian laplacians. Master’s thesis, 2019.
  33. Steinar Laenen and He Sun. Higher-order spectral clustering of directed graphs. Advances in neural information processing systems, 33:941–951, 2020. URL https://proceedings.neurips.cc/paper/2020/hash/0a5052334511e344f15ae0bfafd47a67-Abstract.html.
  34. Concentration and regularization of random graphs. Random Structures & Algorithms, 51(3):538–561, 2017.
  35. Consistency of spectral clustering in stochastic block models. 2015.
  36. Graph evolution: Densification and shrinking diameters. ACM transactions on Knowledge Discovery from Data (TKDD), 1(1):2–es, 2007.
  37. Convex relaxation methods for community detection. 2021.
  38. Clustering and community detection in directed networks: A survey. Physics reports, 533(4):95–142, 2013.
  39. Statistical clustering of temporal networks through a dynamic stochastic block model. Journal of the Royal Statistical Society Series B, 79(4):1119–1141, 2017. URL https://EconPapers.repec.org/RePEc:bla:jorssb:v:79:y:2017:i:4:p:1119-1141.
  40. Directed random geometric graphs. Journal of Complex Networks, 7(5):792–816, 2019.
  41. Semidefinite programs on sparse random graphs and their application to community detection. In Proceedings of the forty-eighth annual ACM symposium on Theory of Computing, pages 814–827, 2016.
  42. Mark EJ Newman. Community detection and graph partitioning. Europhysics Letters, 103(2):28003, 2013.
  43. Mark EJ Newman. Equivalence between modularity optimization and maximum likelihood methods for community detection. Physical Review E, 94(5):052315, 2016.
  44. An overview of social network analysis. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(2):99–115, 2012.
  45. The logic of representing dependencies by directed graphs. In Proceedings of the sixth National conference on Artificial intelligence-Volume 1, pages 374–379, 1987.
  46. Co-clustering for directed graphs: the stochastic co-blockmodel and spectral algorithm di-sim. arXiv preprint arXiv:1204.2296, 2012. URL https://arxiv.org/pdf/1204.2296.pdf.
  47. Co-clustering directed graphs to discover asymmetries and directional communities. Proceedings of the National Academy of Sciences, 113(45):12679–12684, 2016. URL https://www.pnas.org/doi/full/10.1073/pnas.1525793113.
  48. Symmetrizations for clustering directed graphs. In Proceedings of the 14th international conference on extending database technology, pages 343–354, 2011. URL https://dl.acm.org/doi/pdf/10.1145/1951365.1951407?casa_token=IOameE8wvjAAAAAA:s7sr2Ue_mPhbftdLN325dSh6nMdbCjiZFGianaPjY6IqvC8D8S46uVgjv0nJdQvI1kTErZtOkRU.
  49. Normalized cuts and image segmentation. IEEE Transactions on pattern analysis and machine intelligence, 22(8):888–905, 2000.
  50. MA Shubin. Discrete magnetic laplacian. Communications in mathematical physics, 164(2):259–275, 1994.
  51. Henry Small. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for information Science, 24(4):265–269, 1973.
  52. Yu Tian and Renaud Lambiotte. Structural balance and random walks on complex networks with complex weights. arXiv preprint arXiv:2307.01813, 2023.
  53. On the implementation and usage of sdpt3–a matlab software package for semidefinite-quadratic-linear programming, version 4.0. Handbook on semidefinite, conic and polynomial optimization, pages 715–754, 2012.
  54. Joel A Tropp et al. An introduction to matrix concentration inequalities. Foundations and Trends® in Machine Learning, 8(1-2):1–230, 2015.
  55. Ulrike Von Luxburg. A tutorial on spectral clustering. Statistics and computing, 17:395–416, 2007.
  56. Fantope projection and selection: A near-optimal convex relaxation of sparse pca. Advances in neural information processing systems, 26, 2013.
  57. Hermann Weyl. Das asymptotische verteilungsgesetz der eigenwerte linearer partieller differentialgleichungen (mit einer anwendung auf die theorie der hohlraumstrahlung). Mathematische Annalen, 71(4):441–479, 1912.
  58. A useful variant of the davis–kahan theorem for statisticians. Biometrika, 102(2):315–323, 2015.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets