Identity-aware Graph Neural Networks (2101.10320v2)

Published 25 Jan 2021 in cs.LG, cs.AI, and cs.SI

Abstract: Message passing Graph Neural Networks (GNNs) provide a powerful modeling framework for relational data. However, the expressive power of existing GNNs is upper-bounded by the 1-Weisfeiler-Lehman (1-WL) graph isomorphism test, which means GNNs that are not able to predict node clustering coefficients and shortest path distances, and cannot differentiate between different d-regular graphs. Here we develop a class of message passing GNNs, named Identity-aware Graph Neural Networks (ID-GNNs), with greater expressive power than the 1-WL test. ID-GNN offers a minimal but powerful solution to limitations of existing GNNs. ID-GNN extends existing GNN architectures by inductively considering nodes' identities during message passing. To embed a given node, ID-GNN first extracts the ego network centered at the node, then conducts rounds of heterogeneous message passing, where different sets of parameters are applied to the center node than to other surrounding nodes in the ego network. We further propose a simplified but faster version of ID-GNN that injects node identity information as augmented node features. Altogether, both versions of ID-GNN represent general extensions of message passing GNNs, where experiments show that transforming existing GNNs to ID-GNNs yields on average 40% accuracy improvement on challenging node, edge, and graph property prediction tasks; 3% accuracy improvement on node and graph classification benchmarks; and 15% ROC AUC improvement on real-world link prediction tasks. Additionally, ID-GNNs demonstrate improved or comparable performance over other task-specific graph networks.

View on arXiv

Authors (4)

Jiaxuan You (51 papers)
Jonathan Gomes-Selman (1 paper)
Rex Ying (90 papers)
Jure Leskovec (233 papers)

Citations (228)

View on Semantic Scholar

Summary

Insightful Overview of "Identity-aware Graph Neural Networks"

The paper "Identity-aware Graph Neural Networks" addresses the limitations of traditional message passing Graph Neural Networks (GNNs) by introducing a novel class of GNNs termed Identity-aware Graph Neural Networks (ID-GNNs). The authors identify and overcome the expressive power constraint posed by the 1-Weisfeiler-Lehman (1-WL) graph isomorphism test, prevalent in existing GNN architectures. This constraint limits a GNN's ability to predict certain graph properties, such as node clustering coefficients and differentiate between $d$ -regular graphs, due to their inability to distinguish nodes with similar local structures.

Key Contributions

The research proposes ID-GNNs to enhance the expressiveness of standard GNNs beyond the 1-WL test's limitations. The core innovation involves incorporating node identities during the message passing phase, thereby enabling the model to discriminate between structurally indistinguishable nodes. This is achieved through two primary methods:

Heterogeneous Message Passing: During each layer of the ID-GNN, special weight parameters are applied to the central node of an ego network, distinguishing it from surrounding nodes. This is analogous to adding an additional color or identity tag to the node during computation, allowing for the induction of expressive embeddings.
Augmented Node Features: As an alternative, the authors present a simplified version of ID-GNN that augments original node features with cycle counts, thus providing identity information in a computationally efficient manner.

Empirical Results

The evaluation on multiple datasets shows that transforming existing GNNs into their ID-GNN variants yields significant improvements in performance across various tasks. Specifically, average improvements include a 40% higher accuracy in challenging node, edge, and graph property prediction tasks, and a 15% improvement in ROC AUC on real-world link prediction tasks. ID-GNNs also outperform existent task-specific graph networks, demonstrating their versatility and general applicability.

Theoretical Implications

Theoretically, the paper asserts that ID-GNNs have a greater expressive capacity than traditional GNNs. This is evidenced by their ability to differentiate between arbitrary graph structures that 1-WL test-equivalent models cannot, such as certain $d$ -regular graphs. Furthermore, ID-GNNs utilize cycle counts to capture intricate graph features, which are predictive of node clustering coefficients, thereby showcasing a practical application of their theoretical advancements.

Future Directions in AI

This paper opens up several promising directions for advancing graph neural network research. One potential area involves exploring variations in node identity encodings and their impact on model expressiveness and efficiency. Moreover, expanding ID-GNNs to more complex or dynamic graphs could enhance their application scope in various domains, such as bioinformatics, network analysis, and social network dynamics. Understanding the trade-offs between computational complexity and expressive power will also be crucial for deploying these models in large-scale, real-world scenarios.

In conclusion, the introduction of ID-GNNs represents a significant step forward in overcoming the expressive limitations inherent in traditional GNN architectures. By harnessing the inductive biases provided by node identity information, ID-GNNs achieve more nuanced graph representations, enhancing their performance across a wide range of tasks while maintaining computational efficiency. This work lays a robust foundation for future investigations into more expressive and scalable graph learning methodologies.

PDF Markdown

Related Papers

Find Related Papers