Multi-view Contrastive Graph Clustering (2110.11842v1)

Published 22 Oct 2021 in cs.LG, cs.AI, cs.CV, and cs.SI

Abstract: With the explosive growth of information technology, multi-view graph data have become increasingly prevalent and valuable. Most existing multi-view clustering techniques either focus on the scenario of multiple graphs or multi-view attributes. In this paper, we propose a generic framework to cluster multi-view attributed graph data. Specifically, inspired by the success of contrastive learning, we propose multi-view contrastive graph clustering (MCGC) method to learn a consensus graph since the original graph could be noisy or incomplete and is not directly applicable. Our method composes of two key steps: we first filter out the undesirable high-frequency noise while preserving the graph geometric features via graph filtering and obtain a smooth representation of nodes; we then learn a consensus graph regularized by graph contrastive loss. Results on several benchmark datasets show the superiority of our method with respect to state-of-the-art approaches. In particular, our simple approach outperforms existing deep learning-based methods.

Citations (163)

View on Semantic Scholar

Summary

The paper introduces Multi-view Contrastive Graph Clustering (MCGC), a novel framework that uses contrastive learning at the graph level to create a consensus graph for multi-view attributed data.
MCGC incorporates graph filtering and adaptive weighting mechanisms to integrate diverse views and improve the quality of graph representations for clustering.
Experimental results show MCGC consistently outperforms existing methods on benchmark datasets, achieving performance gains, such as up to 19% accuracy improvement over GAE on the IMDB dataset.

Multi-view Contrastive Graph Clustering

This paper presents a novel framework termed Multi-view Contrastive Graph Clustering (MCGC) to address the challenges inherent in clustering multi-view attributed graph data. The authors seek to improve upon existing methodologies by incorporating contrastive learning principles to create a consensus graph that integrates both attribute content and structural information across diverse views. This approach provides an insightful contribution to the field of graph-based clustering where the need to reconcile noisy and incomplete data from multiple views is increasingly pertinent.

Technical Contributions

Contrastive Loss at Graph Level: The key innovation in this paper is the introduction of a contrastive loss function at the graph level. Unlike usual instance-level contrastive learning, this method focuses on pulling similar nodes closer together and pushing dissimilar nodes apart within the graph representation. Such operations enhance the clustering-friendly nature of the consensus graph learned from multiple views.
Graph Filtering: MCGC employs a graph filtering technique designed to reduce high-frequency noise while preserving the geometric integrity of the graph's features. This pre-processing step is critical to obtaining a smooth and reliable representation that serves as the foundation for subsequent clustering tasks.
Adaptive Weighting: The authors utilize an adaptive weighting mechanism to integrate information from various views into the consensus graph. This approach allows for the alignment of differing view contributions, thereby stabilizing the clustering process and enhancing accuracy.

Experimental Evaluation

The efficacy of MCGC is validated on five benchmark datasets: ACM, DBLP, IMDB, Amazon Photos, and Amazon Computers. These datasets vary in node features and graph structures, presenting a comprehensive challenge for multi-view graph clustering solutions.

Performance Gains: MCGC consistently outperforms both shallow methods like LINE and deep learning approaches such as GAE. For instance, improvements in accuracy of up to 19% were seen compared to GAE in the IMDB dataset. This demonstrates the benefit of leveraging multi-view data effectively through contrastive learning principles.
Comparison with Advanced Methods: Even when pitted against existing advanced techniques designed for multi-view data like O2MA and MAGCN, MCGC demonstrates superior performance, highlighting the superiority of its methodology in producing discriminative representations suitable for clustering.

Implications and Future Directions

The results indicate that MCGC is a competitive method that can scale effectively across different datasets. The paper showcases how contrastive learning can be adapted effectively for graph-level data, opening avenues for further research into contrastive techniques within graph-based machine learning frameworks.

Future work could explore the extension of MCGC to handle more complex data structures or higher-dimensional datasets. Furthermore, deeper investigation into the adaptation of this model for large-scale data, where the memory footprint may become a concern, could be valuable. The researchers hint at this limitation as a key area for development.

In conclusion, MCGC represents a meaningful advancement in multi-view graph clustering, particularly in cases where graph representation quality can significantly impact clustering outcomes. The insights provided are a valuable contribution to the ongoing development of robust clustering methodologies in increasingly data-rich environments.