Towards Sparse Hierarchical Graph Classifiers (1811.01287v1)

Published 3 Nov 2018 in stat.ML, cs.AI, cs.LG, and cs.SI

Abstract: Recent advances in representation learning on graphs, mainly leveraging graph convolutional networks, have brought a substantial improvement on many graph-based benchmark tasks. While novel approaches to learning node embeddings are highly suitable for node classification and link prediction, their application to graph classification (predicting a single label for the entire graph) remains mostly rudimentary, typically using a single global pooling step to aggregate node features or a hand-designed, fixed heuristic for hierarchical coarsening of the graph structure. An important step towards ameliorating this is differentiable graph coarsening---the ability to reduce the size of the graph in an adaptive, data-dependent manner within a graph neural network pipeline, analogous to image downsampling within CNNs. However, the previous prominent approach to pooling has quadratic memory requirements during training and is therefore not scalable to large graphs. Here we combine several recent advances in graph neural network design to demonstrate that competitive hierarchical graph classification results are possible without sacrificing sparsity. Our results are verified on several established graph classification benchmarks, and highlight an important direction for future research in graph-based neural networks.

Citations (250)

View on Semantic Scholar

Summary

The paper introduces a novel GNN model that dynamically coarsens graphs to retain structural features while reducing memory overhead.
It interleaves convolution and pooling layers to build a scalable hierarchical architecture that surpasses traditional global pooling methods.
Empirical results on benchmarks like Enzymes, Proteins, D&D, and Collab show competitive accuracy with significantly lower computational costs.

Sparse Hierarchical Graph Classification

The paper "Towards Sparse Hierarchical Graph Classifiers" proposes a novel approach to hierarchical graph classification utilizing graph neural networks (GNNs). This research addresses a significant limitation of existing methods that primarily center around node classification and link prediction, integrating the graph classification problem with an explicit focus on maintaining computational efficiency and scalability.

Overview

Graph classification involves predicting a label for an entire graph structure. Traditional approaches either rely on global pooling of node features, which may lose structural information, or on fixed hierarchical coarsening methods that are not adaptive to varying graph topologies. In contrast, this work innovates by incorporating differentiable graph coarsening akin to CNN image downsampling techniques. This method dynamically reduces the graph size while retaining essential structure and feature information, posing a substantial advantage over established methods.

Model Design

The proposed architecture interleaves graph convolutional and pooling layers in a manner inspired by classical CNNs but tailored for graph-structured data.

Convolutional Layer: The design utilizes a basic propagation rule merging self-looped adjacency matrices with learnable node transformations, allowing it to handle varying graph structures and sizes.
Pooling Layer: The model advances over DiffPool by employing a node-dropping strategy rather than cluster formation, thus mitigating the quadratic memory requirements. The pooling layer uses projection scores to decide which nodes to retain, effectively downsizing the graph while respecting its intrinsic structural properties.
Readout Layer: The architecture aggregates layer-wise graph summaries through a combination of max and average pooling strategies, integrated over each block to form a holistic graph representation used for final classification.

Empirical Evaluation

The model's efficacy is demonstrated through rigorous testing on standard benchmark datasets such as Enzymes, Proteins, D&D, and Collab. The results, as highlighted in the experimental section, reveal that this GNN architecture substantially outperforms baseline methods like GraphSAGE and provides competitive results against DiffPool without incurring the latter's prohibitive memory demands.

Implications and Future Directions

The approach presents practical implications for scalability in graph classification tasks, particularly when handling large graphs common in both biological data analysis and social network studies. Additionally, the research substantiates a foundation for further development in sparse yet expressive graph classification models, paving paths for more efficient algorithms in the domain of graph-based learning.

Future directions might explore extended applications to more dynamically evolving graph structures or integrating additional complex node features to refine classification accuracy. There may also be potential in advancing this framework to unsupervised or semi-supervised scenarios, expanding its application breadth across varied domains.

By maintaining sparsity and leveraging adaptive pooling, this research introduces a potent alternative approach to hierarchical graph classification, contributing to the broader sphere of GNN development and its applications in machine learning tasks.

PDF Markdown