GNNExplainer: Generating Explanations for Graph Neural Networks (1903.03894v4)

Published 10 Mar 2019 in cs.LG and stat.ML

Abstract: Graph Neural Networks (GNNs) are a powerful tool for machine learning on graphs.GNNs combine node feature information with the graph structure by recursively passing neural messages along edges of the input graph. However, incorporating both graph structure and feature information leads to complex models, and explaining predictions made by GNNs remains unsolved. Here we propose GNNExplainer, the first general, model-agnostic approach for providing interpretable explanations for predictions of any GNN-based model on any graph-based machine learning task. Given an instance, GNNExplainer identifies a compact subgraph structure and a small subset of node features that have a crucial role in GNN's prediction. Further, GNNExplainer can generate consistent and concise explanations for an entire class of instances. We formulate GNNExplainer as an optimization task that maximizes the mutual information between a GNN's prediction and distribution of possible subgraph structures. Experiments on synthetic and real-world graphs show that our approach can identify important graph structures as well as node features, and outperforms baselines by 17.1% on average. GNNExplainer provides a variety of benefits, from the ability to visualize semantically relevant structures to interpretability, to giving insights into errors of faulty GNNs.

Authors (5)

Rex Ying (90 papers)
Dylan Bourgeois (4 papers)
Jiaxuan You (51 papers)
Marinka Zitnik (79 papers)
Jure Leskovec (233 papers)

Citations (1,159)

View on Semantic Scholar

Summary

An Overview of GNNExplainer: Generating Explanations for Graph Neural Networks

Graph Neural Networks (GNNs) have emerged as a significant technological advancement for performing machine learning on graph-structured data. They excel at combining node feature information with graph structures via recursive neural message passing along graph edges. However, this complexity has led to a substantial challenge: explaining GNN predictions effectively. This paper introduces GNNExplainer, a model-agnostic framework designed to generate interpretable explanations for GNN predictions across various graph-based machine learning tasks.

Core Proposal

GNNExplainer addresses the interpretability of GNNs by isolating a compact subgraph and a small subset of node features that are crucial to the model's predictions. Unlike previous methods tailored to non-graph neural networks or specific GNN architectures, GNNExplainer is universal and applicable to any GNN without modifications to the neural architecture or retraining.

The crux of GNNExplainer is its formulation as an optimization task whose objective is to maximize the mutual information (MI) between the GNN’s prediction and the distribution of potential subgraph structures. This approach ensures that the explanations remain robust and concise while covering an entire class of instances if necessary.

Methodology

Single-instance Explanations: GNNExplainer zeroes in on individual predictions by identifying critical subgraphs and node features. The method maximizes mutual information, ensuring that the identified subgraph and features substantially influence the prediction. This method applies a mean-field variational approximation, modeling the subgraph via continuous relaxations to facilitate the optimization process.
Multi-instance Explanations: To generate explanations for an entire class of predictions, GNNExplainer aggregates individual instance explanations into prototypes by aligning and summarizing them. This relies on graph alignment techniques, making it possible to distill common structures influential across multiple instances.

Experimental Validation

The paper evaluates GNNExplainer on both synthetic and real-world datasets:

Synthetic Data

Extensive experiments on synthetic datasets such as BA-Shapes, BA-Community, Tree-Cycles, and Tree-Grid illustrate GNNExplainer's superior performance compared to gradient-based saliency maps and graph attention networks (GATs):

BA-Shapes: Houses attached to a Barabási-Albert (BA) graph.
BA-Community: Two interconnected BA-Shapes graphs.
Tree-Cycles: Cycles attached to a balanced binary tree.
Tree-Grid: Grids replacing cycles in Tree-Cycles.

GNNExplainer consistently outperforms baselines, achieving up to 43.0% higher accuracy in identifying important graph structures, demonstrating its robustness and superiority in generating high-fidelity explanations.

Real-world Data

Real-world datasets such as Mutag (molecular graphs) and Reddit-Binary (online discussion threads) further validate GNNExplainer's practical utility:

Mutag: GNNExplainer accurately identifies crucial chemical substructures (e.g., $NO_2$ , carbon rings) associated with mutagenic effects.
Reddit-Binary: It successfully identifies interaction patterns indicative of Question-Answer or Online-Discussion threads, contrary to traditional baselines which provided incomplete explanations.

Implications and Future Work

Practically, GNNExplainer aids in:

Model Transparency: Enhancing trust in models by providing clear rationale behind GNN predictions.
Error Analysis: Diagnosing and debugging faulty GNNs by highlighting where and why the model goes astray.
Domain Insights: Revealing domain-specific structures that influence predictions, beneficial in fields such as chemoinformatics and social network analysis.

Theoretically, GNNExplainer sets a foundational step towards universally applicable GNN interpretability tools. Potential future work could explore scaling GNNExplainer to larger graphs, integrating more sophisticated graph alignment techniques, and developing more nuanced explanations that incorporate dynamic graph structures over time.

Conclusion

GNNExplainer represents a significant stride in making GNN predictions interpretable, applicable across various graph-based machine learning tasks without compromising on generalizability or performance. Its design and effectiveness promise to catalyze advancements in the deployment and trust of GNN technologies in critical applications.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos