Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Balanced Multi-Relational Graph Clustering (2407.16863v1)

Published 23 Jul 2024 in cs.LG, cs.AI, and cs.SI

Abstract: Multi-relational graph clustering has demonstrated remarkable success in uncovering underlying patterns in complex networks. Representative methods manage to align different views motivated by advances in contrastive learning. Our empirical study finds the pervasive presence of imbalance in real-world graphs, which is in principle contradictory to the motivation of alignment. In this paper, we first propose a novel metric, the Aggregation Class Distance, to empirically quantify structural disparities among different graphs. To address the challenge of view imbalance, we propose Balanced Multi-Relational Graph Clustering (BMGC), comprising unsupervised dominant view mining and dual signals guided representation learning. It dynamically mines the dominant view throughout the training process, synergistically improving clustering performance with representation learning. Theoretical analysis ensures the effectiveness of dominant view mining. Extensive experiments and in-depth analysis on real-world and synthetic datasets showcase that BMGC achieves state-of-the-art performance, underscoring its superiority in addressing the view imbalance inherent in multi-relational graphs. The source code and datasets are available at https://github.com/zxlearningdeep/BMGC.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Emmanuel Abbe. 2018. Community detection and stochastic block models: recent developments. Journal of Machine Learning Research 18, 177 (2018), 1–86.
  2. Effects of Graph Convolutions in Multi-layer Networks. In The Eleventh International Conference on Learning Representations.
  3. Sudhanshu Chanpuriya and Cameron Musco. 2022. Simplified graph convolution with heterophily. Advances in Neural Information Processing Systems 35 (2022), 27184–27197.
  4. Adaptive Universal Generalized PageRank Graph Neural Network. In International Conference on Learning Representations.
  5. One2multi graph autoencoder for multi-view graph clustering. In proceedings of the web conference 2020. 3070–3076.
  6. PMR: Prototypical Modal Rebalance for Multimodal Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 20029–20038.
  7. Magnn: Metapath aggregated graph neural network for heterogeneous graph embedding. In Proceedings of The Web Conference 2020. 2331–2341.
  8. Trusted multi-view classification with dynamic evidential fusion. IEEE transactions on pattern analysis and machine intelligence 45, 2 (2022), 2551–2566.
  9. Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In proceedings of the 25th international conference on world wide web. 507–517.
  10. What makes multi-modal learning better than single (provably). Advances in Neural Information Processing Systems 34 (2021), 10944–10956.
  11. Hdmi: High-order deep multiplex infomax. In Proceedings of the Web Conference 2021. 2414–2424.
  12. Thomas N Kipf and Max Welling. 2016. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
  13. Multi-view attributed graph clustering. IEEE Transactions on knowledge and data engineering 35, 2 (2023), 1872–1880.
  14. Dual label-guided graph refinement for multi-view graph clustering. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 8791–8798.
  15. Multilayer graph contrastive clustering network. Information Sciences 613 (2022), 256–267.
  16. Late fusion incomplete multi-view clustering. IEEE transactions on pattern analysis and machine intelligence 41, 10 (2018), 2410–2423.
  17. Deep graph clustering via dual correlation reduction. In Proceedings of the AAAI conference on artificial intelligence, Vol. 36. 7603–7611.
  18. Relation structure-aware heterogeneous information network embedding. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 4456–4463.
  19. Revisiting heterophily for graph neural networks. Advances in neural information processing systems 35 (2022), 1362–1375.
  20. Multiplex Graph Representation Learning Via Dual Correlation Reduction. IEEE Transactions on Knowledge and Data Engineering (2023).
  21. Disentangled multiplex graph representation learning. In International Conference on Machine Learning. PMLR, 24983–25005.
  22. Erlin Pan and Zhao Kang. 2021. Multi-view contrastive graph clustering. Advances in neural information processing systems 34 (2021), 2148–2159.
  23. Erlin Pan and Zhao Kang. 2023a. Beyond homophily: Reconstructing structure for graph-agnostic clustering. In International Conference on Machine Learning. PMLR, 26868–26877.
  24. Erlin Pan and Zhao Kang. 2023b. High-order multi-view clustering for generic data. Information Fusion 100 (2023), 101947.
  25. Unsupervised attributed multiplex network embedding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 5371–5378.
  26. Unsupervised Multiplex Graph Learning with Complementary and Consistent Information. In Proceedings of the 31st ACM International Conference on Multimedia. 454–462.
  27. Balanced multimodal learning via on-the-fly gradient modulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8238–8247.
  28. Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational Clustering. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 14660–14668.
  29. An attention-based collaboration framework for multi-view network representation learning. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 1767–1776.
  30. Semi-Supervised Embedding of Attributed Multiplex Networks. In Proceedings of the ACM Web Conference 2023. 578–587.
  31. Deep fusion clustering network. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 9978–9987.
  32. Deep Graph Infomax. In International Conference on Learning Representations.
  33. Microsoft academic graph: When experts are not enough. Quantitative Science Studies 1, 1 (2020), 396–413.
  34. What makes training multi-modal classification networks hard?. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 12695–12705.
  35. Heterogeneous graph attention network. In The world wide web conference. 2022–2032.
  36. Heterogeneous graph contrastive multi-view learning. In Proceedings of the 2023 SIAM International Conference on Data Mining (SDM). SIAM, 136–144.
  37. Simplifying graph convolutional networks. In International conference on machine learning. PMLR, 6861–6871.
  38. Graph neural networks for natural language processing: A survey. Foundations and Trends® in Machine Learning 16, 2 (2023), 119–328.
  39. Flexible multi-view unsupervised graph embedding. IEEE Transactions on Image Processing 30 (2021), 4143–4156.
  40. Flexible multi-view dimensionality co-reduction. IEEE Transactions on Image Processing 26, 2 (2016), 648–659.
  41. Network schema preserving heterogeneous information network embedding. In International joint conference on artificial intelligence (IJCAI).
  42. Graph neural networks with heterophily. In Proceedings of the AAAI conference on artificial intelligence, Vol. 35. 11168–11176.
Citations (2)

Summary

  • The paper introduces BMGC, a framework that addresses view imbalances in multi-relational graphs using unsupervised dominant view mining and co-aligned representation learning.
  • It proposes the Aggregation Class Distance metric to evaluate structural disparities, yielding superior clustering performance across metrics like NMI, ARI, ACC, and F1.
  • Extensive experiments on synthetic and real-world data validate BMGC's state-of-the-art results, highlighting its potential in recommendation systems and network analysis.

Balanced Multi-Relational Graph Clustering

The proliferation of multi-relational graphs in real-world applications, such as citation and social networks, presents unique clustering challenges. This academic paper addresses these challenges with a particular focus on the imbalance across different views within these graphs. It introduces a novel approach termed Balanced Multi-Relational Graph Clustering (BMGC), which effectively tackles the inherent view imbalance through unsupervised dominant view mining and co-aligned representation learning.

Key Contributions

  1. View Imbalance in Multi-Relational Graphs: The paper identifies that differing views in multi-relational graphs possess varying degrees of relevance to clustering tasks. Traditional methods, which aim to align all views equally, may overlook this discrepancy, leading to suboptimal performance. BMGC, conversely, incorporates mechanisms to dynamically determine the dominant view, harnessing its strengths to guide representation learning.
  2. Aggregation Class Distance (ACD): ACD is proposed as a metric to evaluate structural disparities between views. It provides a more comprehensive analysis by accounting for node aggregation processes and node class distributions, outperforming traditional metrics like homophily ratios, especially in scenarios where graph structure homogeneity doesn't directly translate to better task performance.
  3. Balanced Multi-Relational Graph Clustering (BMGC): The BMGC framework is designed to dynamically mine the dominant view of the graph and synergizes this information with representation learning processes. BMGC leverages dual signal guidance from both dominant views and original node features, aligning their representations through contrastive learning. It further enhances clustering quality with dominant assignments.
  4. Theoretical Validation: The efficacy of dominant view mining is supported by theoretical analysis, indicating a strong correlation between minimized similarity loss in aggregated features and optimal clustering partitions.
  5. Experimental Validation: Extensive experiments on both synthetic and real-world datasets demonstrate BMGC's superiority. Notably, BMGC achieves state-of-the-art results across various metrics like NMI, ARI, ACC, and F1, outperforming traditional and modern graph clustering methods.

Implications and Future Directions

The development of BMGC showcases a significant advancement in handling view imbalances in multi-relational graphs, with implications that extend beyond clustering. Practically, this approach can improve the accuracy of network analysis in domains such as recommendation systems and biochemical networks, where understanding multiple relational perspectives is crucial.

Theoretically, BMGC opens avenues for future exploration such as further refining ACD to include additional graph properties or extending the framework to semi-supervised scenarios. Further exploration into the scalability of BMGC with respect to ultra-large datasets will also be essential, potentially leveraging distributed computing resources.

In conclusion, BMGC offers a well-founded, empirically validated framework for multi-relational graph clustering, setting a new benchmark in the domain. It highlights the importance of considering view imbalances, providing a roadmap for future research to build upon its foundational concepts.

Github Logo Streamline Icon: https://streamlinehq.com
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

Youtube Logo Streamline Icon: https://streamlinehq.com