Local Spectral Graph Convolution for Point Set Feature Learning (1803.05827v1)

Published 15 Mar 2018 in cs.CV and cs.LG

Abstract: Feature learning on point clouds has shown great promise, with the introduction of effective and generalizable deep learning frameworks such as pointnet++. Thus far, however, point features have been abstracted in an independent and isolated manner, ignoring the relative layout of neighboring points as well as their features. In the present article, we propose to overcome this limitation by using spectral graph convolution on a local graph, combined with a novel graph pooling strategy. In our approach, graph convolution is carried out on a nearest neighbor graph constructed from a point's neighborhood, such that features are jointly learned. We replace the standard max pooling step with a recursive clustering and pooling strategy, devised to aggregate information from within clusters of nodes that are close to one another in their spectral coordinates, leading to richer overall feature descriptors. Through extensive experiments on diverse datasets, we show a consistent demonstrable advantage for the tasks of both point set classification and segmentation.

Authors (3)

Chu Wang (44 papers)
Babak Samari (4 papers)
Kaleem Siddiqi (15 papers)

Citations (362)

View on Semantic Scholar

Summary

Local Spectral Graph Convolution for Point Set Feature Learning

The paper "Local Spectral Graph Convolution for Point Set Feature Learning" by Chu Wang et al. introduces an innovative approach to feature learning on 3D point clouds using spectral graph convolution. This work addresses the challenge of learning from unstructured 3D point clouds by integrating spectral graph convolution with a novel pooling strategy to better capture structural information in local neighborhoods of points. The authors propose a method that combines local graph convolution on a nearest-neighbor graph with recursive cluster pooling, yielding significant improvements in feature learning for both classification and segmentation tasks on 3D point data.

The paper critiques existing methods, such as PointNet++, for their inability to consider the relationships and spatial distribution of points in local neighborhoods. In response, it harnesses spectral graph convolution, which allows for joint feature learning from all points in a neighborhood while simultaneously considering the geometric and topological layout. By constructing a nearest-neighbor graph around each point, the proposed method employs convolution in the spectral domain, where neighborhood features are blended using the eigenspace of the graph Laplacian. This technique, distinct from prior spectral approaches, does not rely on pre-computed Laplacians or coarsening hierarchies, thus enhancing computational efficiency during network training on large datasets.

A key contribution of the paper is its recursive cluster pooling method, which aggregates information by repeatedly clustering spectral coordinates within local neighborhoods before pooling. This contrasts with conventional max pooling approaches, which can lose fine-grained local structural information. By using the Fiedler vector for spectral clustering, the proposed pooling method retains features from distinct components within local neighborhoods, preserving critical information that might otherwise be lost.

The empirical results underscore the efficacy of the proposed method. On the ModelNet40, McGill Shape Benchmark, and MNIST datasets, the authors demonstrate improved accuracy and state-of-the-art results for object recognition tasks. In segmentation tasks on the ShapeNet and ScanNet datasets, the method also achieves notable performance, with enhanced ability to capture intricate local geometric features. The computational efficiency, specifically the comparable runtime to PointNet++, highlights the practicality of the method.

The approach holds significant implications for the field of 3D computer vision, particularly in contexts where depth sensors produce unorganized point clouds. Not only does this method advance the state-of-the-art in point set feature learning, but it also suggests potential for further exploration into spectral-based techniques on graph representations of data. As depth sensing technology becomes ubiquitous, the practical benefits of such methods for real-world applications will continue to grow, necessitating ongoing research into enhancing their scalability and robustness. Future developments may explore learning architectures that more deeply integrate spectral methods, perhaps extending them to other domains such as video or temporal point cloud data, thereby broadening the applicability of spectral graph convolution in deep learning paradigms.

PDF Markdown

Related Papers

Find Related Papers