Covariant Compositional Networks For Learning Graphs (1801.02144v1)

Published 7 Jan 2018 in cs.LG

Abstract: Most existing neural networks for learning graphs address permutation invariance by conceiving of the network as a message passing scheme, where each node sums the feature vectors coming from its neighbors. We argue that this imposes a limitation on their representation power, and instead propose a new general architecture for representing objects consisting of a hierarchy of parts, which we call Covariant Compositional Networks (CCNs). Here, covariance means that the activation of each neuron must transform in a specific way under permutations, similarly to steerability in CNNs. We achieve covariance by making each activation transform according to a tensor representation of the permutation group, and derive the corresponding tensor aggregation rules that each neuron must implement. Experiments show that CCNs can outperform competing methods on standard graph learning benchmarks.

Citations (160)

View on Semantic Scholar

Summary

The paper introduces CCNs as a novel architecture that activates neurons covariantly under node permutations.
It employs tensor representations to capture structural details, outperforming traditional message-passing methods on key graph benchmarks.
CCNs demonstrate enhanced flexibility in modeling complex graphs, paving the way for future research in non-Euclidean learning.

Covariant Compositional Networks for Learning Graphs

The paper introduces Covariant Compositional Networks (CCNs) as a novel architecture for learning graph-structured data. The authors address a critical limitation in existing neural network approaches for graph learning, namely their inability to capture permutation covariance effectively. Rather than employing traditional message-passing schemes that focus on permutation invariance like Message Passing Neural Networks (MPNNs), the paper explores a more nuanced representation power by ensuring that the activations of neurons transform covariantly under permutations.

Summary of Approach

At the core of CCNs is the concept of covariant transformation, similar in principle to steerability in Convolutional Neural Networks (CNNs). In graph learning, permutation of nodes is a common transformation, and most existing architectures deal with this by seeking permutation invariance, typically achieved by summing feature vectors from neighboring nodes. However, this approach discards potentially valuable information about the graph's structure.

CCNs instead propose a hierarchical model of parts and employ tensor representations of the permutation group $\Sbb_m$. Each neuron in a CCN is tasked with activating in a manner consistent with a particular order of these tensors. The authors derive tensor aggregation rules that ensure each neuron's activation aligns with its tensor representation under permutations. Importantly, CCNs outperform traditional methods on graph learning benchmarks, illustrating their potential as a more expressive and powerful alternative to current architectures.

Key Results

The experiments conducted demonstrate that CCNs achieve superior performance over traditional methods on several well-established graph datasets, such as the Harvard Clean Energy Project dataset, MUTAG, and others. Specifically, CCNs show a distinct advantage in handling larger and more complex datasets where constructing robust node representations requires leveraging permutation covariant structures.

Implications and Future Directions

This work aligns with the broader goal of generalizing neural network models to non-Euclidean domains, leveraging more of the structure inherent in graph data rather than averaging it out. The introduction of covariant representations extends the flexibility and the power of graph neural networks, providing a fertile area for future innovations.

In future developments, CCNs could be expanded or refined to handle dynamic and multi-scale graph data even more effectively. This is particularly relevant in applications like chemoinformatics and bioinformatics, where understanding the interplay between local structural details and global properties is crucial. Additionally, exploring higher-order tensor formulations and integrating CCNs with other graph-based optimization tasks could further enhance their applicability.

The paper marks a significant step toward more expressive graph learning architectures, suggesting a productive direction for future research in graph convolutional networks and their applications in diverse scientific and real-world problems.

PDF Markdown

Related Papers

Tweets

https://twitter.com/_onionesque/status/1816694188408283460