Graph Contrastive Learning with Adaptive Augmentation (2010.14945v3)

Published 27 Oct 2020 in cs.LG

Abstract: Recently, contrastive learning (CL) has emerged as a successful method for unsupervised graph representation learning. Most graph CL methods first perform stochastic augmentation on the input graph to obtain two graph views and maximize the agreement of representations in the two views. Despite the prosperous development of graph CL methods, the design of graph augmentation schemes -- a crucial component in CL -- remains rarely explored. We argue that the data augmentation schemes should preserve intrinsic structures and attributes of graphs, which will force the model to learn representations that are insensitive to perturbation on unimportant nodes and edges. However, most existing methods adopt uniform data augmentation schemes, like uniformly dropping edges and uniformly shuffling features, leading to suboptimal performance. In this paper, we propose a novel graph contrastive representation learning method with adaptive augmentation that incorporates various priors for topological and semantic aspects of the graph. Specifically, on the topology level, we design augmentation schemes based on node centrality measures to highlight important connective structures. On the node attribute level, we corrupt node features by adding more noise to unimportant node features, to enforce the model to recognize underlying semantic information. We perform extensive experiments of node classification on a variety of real-world datasets. Experimental results demonstrate that our proposed method consistently outperforms existing state-of-the-art baselines and even surpasses some supervised counterparts, which validates the effectiveness of the proposed contrastive framework with adaptive augmentation.

View on arXiv

Authors (6)

Yanqiao Zhu (45 papers)
Yichen Xu (40 papers)
Feng Yu (58 papers)
Qiang Liu (405 papers)
Shu Wu (109 papers)
Liang Wang (512 papers)

Citations (954)

View on Semantic Scholar

Summary

An Academic Overview of "Graph Contrastive Learning with Adaptive Augmentation"

The paper "Graph Contrastive Learning with Adaptive Augmentation" presents a novel framework for unsupervised graph representation learning using contrastive learning (CL) techniques. The proposed method, named GCA (Graph Contrastive learning with Adaptive augmentation), aims to enhance the process of learning node representations by introducing adaptive data augmentation strategies. This effort addresses a significant gap in the current literature where augmentation strategies for CL in the context of graphs have been scarcely explored, despite their proven importance in other domains such as image processing.

Technical Contribution and Methodology

The major contribution of the paper is rooted in the adaptive augmentation strategy that is split into two main components: topology and node attribute augmentation. The authors argue that the augmentation scheme should prioritize preserving intrinsic graph structures and attributes, thus compelling the model to learn robust representations that are not disturbed by perturbations in non-essential nodes and edges.

Topology-Level Augmentation: This involves altering the graph structure by strategically removing edges. The removal process is guided by node centrality measures such as degree centrality, eigenvector centrality, and PageRank. Central to this approach is the idea that edges connected to highly influential nodes should be preserved more rigorously compared to those connected to less critical nodes.
Node-Attribute-Level Augmentation: This involves corrupting node features with noise, where the disruption is more prevalent on unimportant features. The importance of each feature is measured based on its frequency and magnitude in influential nodes determined by centrality measures.

The GCA model generates two graph views using these augmentation schemes and then employs a shared Graph Neural Network (GNN) to learn representations from these views. The training objective is a contrastive loss that maximizes the agreement between the same node representations across the two views while minimizing their similarity with other nodes, effectively distinguishing positive node pairs from negative ones.

Key Findings and Numerical Results

The experimental results reflect the effectiveness of the GCA method. On several benchmark datasets—Wiki-CS, Amazon-Computers, Amazon-Photo, Coauthor-CS, and Coauthor-Physics—the proposed method not only consistently outperforms state-of-the-art unsupervised graph representation learning approaches but also occasionally surpasses supervised methods.

For instance, GCA demonstrated superior performance on the Amazon-Photo dataset with an accuracy rate of 92.53%, significantly higher than the best unsupervised baseline (MVGRL at 91.74%) and competitive with the supervised GAT and GCN models. Similar results were observed across the other datasets, solidifying the claim that the adaptive augmentation enhances the representation learning process.

Implications and Future Directions

Practically, the adaptive augmentation strategy in GCA can be transplanted to existing graph-based applications, such as recommendation systems, social network analysis, and bioinformatics, to improve their accuracy and robustness without extensive labeled data. Theoretically, this approach paves the way for exploring more sophisticated augmentation methods that could leverage additional graph properties, further increasing the versatility and applicability of unsupervised learning on graph structures.

The findings suggest several promising directions for future work:

Exploring Different Centrality Measures: While degree, eigenvector, and PageRank centralities were effective, other measures potentially tailored to specific graph types could yield even better results.
Dynamic Augmentation Strategies: Developing more dynamic strategies that adapt in real-time as the model learns could further improve robustness and performance.
Extending to Heterogeneous Graphs: Adapting the GCA framework to handle heterogeneous graphs where nodes and edges are of different types and feature sets.

In conclusion, the paper makes a substantial contribution to the field of unsupervised learning on graph-structured data through its innovative use of adaptive augmentation, achieving notable empirical success and opening avenues for further research and applications in a variety of domains.

PDF Markdown

Related Papers

Find Related Papers