Spectral Networks and Locally Connected Networks on Graphs (1312.6203v3)

Published 21 Dec 2013 in cs.LG, cs.CV, and cs.NE

Abstract: Convolutional Neural Networks are extremely efficient architectures in image and audio recognition tasks, thanks to their ability to exploit the local translational invariance of signal classes over their domain. In this paper we consider possible generalizations of CNNs to signals defined on more general domains without the action of a translation group. In particular, we propose two constructions, one based upon a hierarchical clustering of the domain, and another based on the spectrum of the graph Laplacian. We show through experiments that for low-dimensional graphs it is possible to learn convolutional layers with a number of parameters independent of the input size, resulting in efficient deep architectures.

Citations (4,670)

View on Semantic Scholar

Summary

The paper introduces novel graph convolution methods using spatial and spectral constructions to adapt CNNs to non-Euclidean domains.
It leverages the graph Laplacian eigenbasis for spectral filters, reducing parameter complexity while ensuring spatial localization.
Experimental results on subsampled MNIST and MNIST on the sphere demonstrate enhanced efficiency and robustness for complex structured data.

Spectral Networks and Deep Locally Connected Networks on Graphs: An Overview

The paper "Spectral Networks and Deep Locally Connected Networks on Graphs," authored by Joan Bruna, Wojciech Zaremba, and several collaborators, investigates the extension of Convolutional Neural Networks (CNNs) to domains represented by graphs, thereby addressing a significant gap in adapting CNNs to non-Euclidean spaces. This work is pivotal for advancing machine learning applications to structured data like social networks, 3D meshes, and sensor networks, where conventional CNNs fall short due to the absence of grid-like structures and translational invariance.

Key Contributions

Graph-Based Convolutional Layers: The paper introduces two primary constructions for CNNs on graphs—spatial and spectral constructions. The spatial construction generalizes the notion of locality and pooling from regular grids to arbitrary graphs. It employs multiscale clustering, enabling the definition of locally connected networks with parameters scaling linearly with the number of nodes.
Spectral Construction: This approach exploits the graph Laplacian's spectral properties, analogous to the Fourier transform in Euclidean spaces. This construction allows the definition of convolutional layers in the frequency domain, reducing the parameter count per feature map to be independent of the input size.

Spatial Construction

The spatial construction's essence lies in defining filters over local neighborhoods determined by the graph's edge weights. These filters are compactly supported, significantly reducing the number of parameters compared to fully connected layers. Hierarchical clustering of the graph facilitates the creation of multiresolution representations, analogous to pooling in conventional CNNs. Notably, this construction is efficient with a parameter complexity of $O(S \cdot n)$ , where $S$ is the average size of a node's neighborhood.

Spectral Construction

The spectral construction leverages the graph Laplacian's eigenbasis. Here, filters are defined as multipliers in the Laplacian's frequency domain, akin to traditional Fourier-based convolutions. By utilizing the Laplacian's eigenvectors, convolution operations become diagonal in the spectral domain, permitting a reduction in parameter complexity. The smooth spectral multipliers further enhance this by ensuring spatial localization of the filters.

Numerical Experiments and Results

The proposed architectures were validated on variations of the MNIST dataset, including subsampled versions and representations on a 3D sphere:

Subsampled MNIST: In this experiment, a 28x28 grid was subsampled to 400 coordinates. Spatially-aware architectures outperformed fully connected networks, achieving an error rate as low as 1.3% with significantly fewer parameters. The smooth spectral construction, which introduced parameter efficiency, also performed competitively.
MNIST on the Sphere: Distortions introduced by random projections of MNIST digits onto a 3D unit sphere demonstrated the robustness of the graph-based CNNs. The spectral and smooth spectral constructions showed promising performance, particularly when handling rotations and achieving spatial localization through spectral smoothness, despite the inherently non-Euclidean nature of the data.

Implications and Future Directions

The presented methodologies underscore the potential for graph-based architectures to generalize traditional CNNs to non-Euclidean spaces effectively. These constructions can leverage the intrinsic geometry of the data, providing powerful tools for applications ranging from social network analysis to 3D object recognition.

Several open research avenues emerge from this work:

Enhanced Multiscale Clustering: Developing clustering methods that respect the graph's Laplacian structure could bridge the spatial and spectral constructions, enabling more efficient and interpretable models.
Extension to Dynamic Graphs: Applying these methods to dynamic settings where the graph structure evolves over time presents a challenging yet impactful direction.
Broader Applications: Exploring these techniques in recommendation systems, biological networks, and other domains having irregular structures can significantly impact practical machine learning applications.

This paper sets a foundation for future work in extending convolutional operations to graph-based structures, providing a robust framework for handling complex, structured data beyond the capabilities of conventional CNNs. The methodologies presented promise significant advancements in both theoretical insights and practical efficiency for graph-based deep learning.

PDF Markdown