Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering (2401.06704v2)
Abstract: We introduce a highly efficient method for panoptic segmentation of large 3D point clouds by redefining this task as a scalable graph clustering problem. This approach can be trained using only local auxiliary tasks, thereby eliminating the resource-intensive instance-matching step during training. Moreover, our formulation can easily be adapted to the superpoint paradigm, further increasing its efficiency. This allows our model to process scenes with millions of points and thousands of objects in a single inference. Our method, called SuperCluster, achieves a new state-of-the-art panoptic segmentation performance for two indoor scanning datasets: $50.1$ PQ ($+7.8$) for S3DIS Area~5, and $58.7$ PQ ($+25.2$) for ScanNetV2. We also set the first state-of-the-art for two large-scale mobile mapping benchmarks: KITTI-360 and DALES. With only $209$k parameters, our model is over $30$ times smaller than the best-competing method and trains up to $15$ times faster. Our code and pretrained models are available at https://github.com/drprojects/superpoint_transformer.
- SLIC superpixels compared to state-of-the-art superpixel methods. TPAMI, 2012.
- A survey on deep-learning-based LiDAR 3D object detection for autonomous driving. Sensors, 2022.
- 3D semantic parsing of large-scale indoor spaces. CVPR, 2016.
- 3D Scene Graph: A structure for unified semantics, 3D space, and camera. ICCV, 2019.
- 4D panoptic LiDAR segmentation. CVPR, 2021.
- Alexandre Boulch. ConvPoint: Continuous convolutions for point cloud processing. Computers & Graphics, 2020.
- Fast approximate energy minimization via graph cuts. TPAMI, 2001.
- End-to-end object detection with transformers. ECCV, 2020.
- Torch-Points3D: A modular multi-task framework for reproducible deep learning on 3D point clouds. 3DV, 2020.
- Hierarchical aggregation for 3D instance segmentation. ICCV, 2021.
- Masked-attention mask transformer for universal image segmentation. CVPR, 2022.
- 4D spatio-temporal convnets: Minkowski convolutional neural networks. CVPR, 2019.
- ScanNet: Richly-annotated 3D reconstructions of indoor scenes. CVPR, 2017.
- 3D-MPA: Multi-proposal aggregation for 3D semantic instance segmentation. CVPR, 2020.
- Panoptic nuScenes: A large-scale benchmark for LiDAR panoptic segmentation and tracking. Robotics and Automation Letters, 2022.
- Deep learning for 3D point clouds: A survey. TPAMI, 2020.
- OccuSeg: Occupancy-aware 3D instance segmentation. CVPR, 2020.
- Large scale mapping of forest attributes using heterogeneous sets of airborne laser scanning and national forest inventory data. Forest Ecosystems, 2021.
- DyCo3D: Robust instance segmentation of 3D point clouds through dynamic convolution. CVPR, 2021a.
- Deep learning based 3D segmentation: A survey. arXiv preprint arXiv:2103.05423, 2021b.
- Superpoint network for point cloud oversegmentation. ICCV, 2021.
- DETRs with hybrid matching. CVPR, 2023.
- PointGroup: Dual-set point grouping for 3D instance segmentation. CVPR, 2020.
- Industrial applications of digital twins. Philosophical Transactions of the Royal Society A, 2021.
- Region-enhanced feature learning for scene semantic segmentation. arXiv preprint arXiv:2304.07486, 2023.
- Panoptic segmentation. CVPR, 2019.
- What energy functions can be minimized via graph cuts? TPAMI, 2004.
- Robust and efficient surface reconstruction from range data. Computer Graphics Forum, 2009.
- Creating large-scale city models from 3D-point clouds: A robust approach with hybrid representation. ICCV, 2012.
- 3D instance segmentation via multi-task metric learning. ICCV, 2019.
- Stratified transformer for 3D point cloud segmentation. CVPR, 2022.
- Point cloud oversegmentation with graph-structured deep metric learning. CVPR, 2019.
- Cut Pursuit: Fast algorithms to learn piecewise constant functions. AISTATS, 2016.
- Cut Pursuit: Fast algorithms to learn piecewise constant functions on general weighted graphs. SIAM Journal on Imaging Sciences, 2017.
- Large-scale point cloud semantic segmentation with superpoint graphs. CVPR, 2018.
- A structured regularization framework for spatially smoothing semantic labelings of 3D point clouds. ISPRS Journal of Photogrammetry and Remote Sensing, 2017.
- Yvan G Leclerc. Constructing simple stable descriptions for image partitioning. IJCV, 1989.
- Instance segmentation in 3D scenes using semantic superpoint tree networks. CVPR, 2021.
- KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2D and 3D. TPAMI, 2022.
- Toward better boundary preserved supervoxel segmentation for 3D point clouds. ISPRS Journal of Photogrammetry and Remote Sensing, 2018.
- Online segmentation of LiDAR sequences: Dataset and algorithm. ECCV, 2022.
- Rectifier nonlinearities improve neural network acoustic models. ICML, 2013.
- EfficientPS: Efficient panoptic segmentation. IJCV, 2021.
- Optimal approximations by piecewise smooth functions and associated variational problems. Communications on Pure and Applied Mathematics, 1989.
- PanopticFusion: Online volumetric semantic mapping at the level of stuff and things. IROS, 2019.
- ISNBET: A 3D point cloud instance segmentation network with instance-aware sampling and box-aware dynamic convolution. CVPR, 2023.
- Scaling digital twins from the artisanal to the industrial. Nature Computational Science, 2021.
- A socio-technical perspective on urban analytics: The case of city-scale digital twins. Journal of Urban Technology, 2021.
- Voxel cloud connectivity segmentation-supervoxels for point clouds. CVPR, 2013.
- Renfrey Burnard Potts. Some generalized order-disorder transformations. Mathematical Proceedings of the Cambridge Philosophical Society, 1952.
- PointNet++: Deep hierarchical feature learning on point sets in a metric space. NeurIPS, 2017.
- PointNeXt: Revisiting PointNet++ with improved training and scaling strategies. NeurIPS, 2022.
- Panoptic segmentation in industrial environments using synthetic and real data. ICIAP, 2022.
- Parallel cut pursuit for minimization of the graph total variation. ICML Workshop on Graph Reasoning, 2019.
- Learning multi-view aggregation in the wild for large-scale 3D semantic segmentation. CVPR, 2022.
- Efficient 3D semantic segmentation with superpoint transformer. ICCV, 2023.
- Mask3D: Mask transformer for 3D semantic instance segmentation. ICRA, 2023.
- DALES Objects: A large scale benchmark dataset for instance segmentation in aerial LiDAR. IEEE Access, 2021.
- Large-scale classification of water areas using airborne topographic LiDAR data. Remote sensing of environment, 2013.
- Superpoint transformer for 3D scene instance segmentation. AAAI, 2023.
- KPConv: Flexible and deformable convolution for point clouds. ICCV, 2019.
- Segment-Fusion: Hierarchical context fusion for robust 3D semantic segmentation. CVPR, 2022.
- DALES: A large-scale aerial LiDAR data set for semantic segmentation. CVPR Workshops, 2020.
- SoftGroup for 3D instance segmentation on point clouds. CVPR, 2022.
- Learning 3D semantic scene graphs from 3D indoor reconstructions. CVPR, 2020.
- Urban 3D modeling with mobile laser scanning: A review. Virtual Reality & Intelligent Hardware, 2020.
- Peng-Shuai Wang. OctFormer: Octree-based transformers for 3D point clouds. SIGGRAPH, 2023.
- Window normalization: Enhancing point cloud understanding by unifying inconsistent point densities. 2022.
- SceneGraphFusion: Incremental 3D scene graph prediction from RGB-D sequences. CVPR, 2021.
- Point Transformer V2: Grouped vector attention and partition-based pooling. NeurIPS, 2022.
- A review of panoptic segmentation for mobile mapping point clouds. arXiv preprint arXiv:2304.13980, 2023.
- LiDAR applications to estimate forest biomass at individual tree scale: Opportunities, challenges and future perspectives. Forests, 2021.
- From LiDAR point cloud towards digital twin city: Clustering city objects based on gestalt principles. ISPRS Journal of Photogrammetry and Remote Sensing, 2020.
- Urban land cover classification using airborne LiDAR data: A review. Remote Sensing of Environment, 2015.
- Learning object bounding boxes for 3D instance segmentation on point clouds. NeurIPS, 2019.
- A comprehensive survey of LiDAR-Based 3D Object detection methods with deep learning for autonomous driving. Computers & Graphics, 2021.
- PolarNet: An improved grid representation for online LiDAR point clouds semantic segmentation. CVPR, 2020.
- Point Transformer. ICCV, 2021a.
- A technical survey and evaluation of traditional point cloud clustering methods for LiDAR panoptic segmentation. ICCV Workshop, 2021b.
- Joint 3D instance segmentation and object detection for autonomous driving. CVPR, 2020.
- Panoptic-PolarNet: Proposal-free LiDAR point cloud panoptic segmentation. CVPR, 2021.
- Cylindrical and asymmetrical 3D convolution networks for LiDAR segmentation. CVPR, 2021.