Linking Points With Labels in 3D: A Review of Point Cloud Semantic Segmentation (1908.08854v3)

Published 23 Aug 2019 in cs.CV, cs.LG, and eess.IV

Abstract: 3D Point Cloud Semantic Segmentation (PCSS) is attracting increasing interest, due to its applicability in remote sensing, computer vision and robotics, and due to the new possibilities offered by deep learning techniques. In order to provide a needed up-to-date review of recent developments in PCSS, this article summarizes existing studies on this topic. Firstly, we outline the acquisition and evolution of the 3D point cloud from the perspective of remote sensing and computer vision, as well as the published benchmarks for PCSS studies. Then, traditional and advanced techniques used for Point Cloud Segmentation (PCS) and PCSS are reviewed and compared. Finally, important issues and open questions in PCSS studies are discussed.

Citations (207)

View on Semantic Scholar

Summary

The paper comprehensively reviews state-of-the-art techniques in 3D point cloud semantic segmentation, emphasizing both deep learning and traditional methods.
It details a range of segmentation approaches—including unsupervised, supervised, and hybrid models—and addresses challenges like noise robustness.
The review underscores the significance of benchmark datasets and outlines future directions to enhance model generalization in diverse applications.

Linking Points With Labels in 3D: A Review of Point Cloud Semantic Segmentation

The paper "Linking Points With Labels in 3D: A Review of Point Cloud Semantic Segmentation" presents a comprehensive review of the advances in the field of Point Cloud Semantic Segmentation (PCSS). This work addresses the growing interest in PCSS driven by applications in remote sensing, computer vision, and robotics, alongside the promise of utilizing deep learning techniques to enhance these applications. The paper summarizes the state-of-the-art techniques and challenges in PCSS, providing a critical resource for researchers engaged in this domain.

PCSS capitalizes on the potential of 3D point clouds, which can be directly acquired from sensors with distance measurability, or generated from multi-view imagery. Point clouds provide a means to bridge the virtual and real world by offering a structural representation of 3D spaces vital for 3D applications such as urban planning, BIM, SLAM, and automated driving. Despite broad applicability, PCSS presents unique challenges within remote sensing, computer vision, and other fields due to the 3D nature of data.

Acquisition and Benchmarks of 3D Point Clouds

Point clouds can be acquired through various methods including Image-derived methods, LiDAR systems, RGB-D cameras, and Synthetic Aperture Radar (SAR) systems. The differences in data characteristics, such as point density and accuracy, ensuing from these methods, have been thoroughly accounted for in the paper. The review also addresses notable benchmark datasets, such as Semantic3D.net and S3DIS, which play a critical role in improving and evaluating PCSS methodologies.

Segmentation Techniques in Point Clouds

The methodologies undertaken in point cloud segmentation can be categorized into unsupervised techniques—edge-based, region-growing, model-fitting, and clustering-based models. These approaches have been primarily directed at grouping spatially coherent points into surfaces or structures without explicit semantic information. The paper elaborates on how different techniques leverage geometric constraints or statistical models to partition point clouds into meaningful regions.

Edge-based segmentation utilizes abrupt intensity changes, while region-growing methods rely on spatial proximity and surface characteristics to merge similar nearby points. Model fitting techniques like RANSAC focus on fitting geometric shapes to point cloud data. Clustering-based methods, including k-means, fuzzy clustering, and mean-shift clustering, segment data by grouping points with shared features, effectively dealing with more irregular data structures.

Supervised and Deep Learning Approaches

The rise of supervised learning techniques, including support vector machines and random forests, adjusted to extract and classify each point cloud into semantic entities, reflects their vital role in PCSS. Moreover, regularization using methods like CRFs has been shown to significantly ameliorate noise and smoothen semantic labeling.

Deep learning methods have introduced a paradigm shift in dealing with 3D data through point-based networks such as PointNet, voxel-based approaches, and multiview frameworks. These networks learn rich hierarchical feature representations directly from raw point cloud data. The flexibility of directly processing point clouds using architectures such as PointNet reflects a significant advancement—yet challenges remain in developing networks resilient to noise and in the generalization to diverse datasets.

Future Directions and Challenges

While deep learning has greatly enhanced the accuracy and efficiency of PCSS tasks, challenges persist. The complexity and unique noise characteristics of various point cloud datasets necessitate continued research into noise-robust and context-aware algorithms. Moreover, expanding benchmark datasets to include greater diversity in geographical and structural features could enhance model robustness across different remote sensing applications. The development of hybrid models that integrate both supervised learning and geometric constraints might offer a beneficial path forward in this complex field.

In conclusion, this paper provides a thorough appraisal of existing techniques for point cloud segmentation and semantic labeling. Diehard research efforts in this area continue to yield promising improvements, yet the demand for domain-specific advancements and greater interpretability of deep learning models remains crucial for real-world applicability.

PDF Markdown