Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise Binarization (2207.11209v4)
Abstract: Instance segmentation on point clouds is crucially important for 3D scene understanding. Most SOTAs adopt distance clustering, which is typically effective but does not perform well in segmenting adjacent objects with the same semantic label (especially when they share neighboring points). Due to the uneven distribution of offset points, these existing methods can hardly cluster all instance points. To this end, we design a novel divide-and-conquer strategy named PBNet that binarizes each point and clusters them separately to segment instances. Our binary clustering divides offset instance points into two categories: high and low density points (HPs vs. LPs). Adjacent objects can be clearly separated by removing LPs, and then be completed and refined by assigning LPs via a neighbor voting method. To suppress potential over-segmentation, we propose to construct local scenes with the weight mask for each instance. As a plug-in, the proposed binary clustering can replace traditional distance clustering and lead to consistent performance gains on many mainstream baselines. A series of experiments on ScanNetV2 and S3DIS datasets indicate the superiority of our model. In particular, PBNet ranks first on the ScanNetV2 official benchmark challenge, achieving the highest mAP. Code will be available publicly at https://github.com/weiguangzhao/PBNet.
- 3d semantic parsing of large-scale indoor spaces. In CVPR, pages 1534–1543, 2016.
- Hybrid task cascade for instance segmentation. In CVPR, pages 4974–4983, 2019.
- Hierarchical aggregation for 3d instance segmentation. In ICCV, pages 15467–15476, 2021.
- Fully convolutional geometric features. In ICCV, pages 8958–8966, 2019.
- Scannet: Richly-annotated 3d reconstructions of indoor scenes. In CVPR, pages 5828–5839, 2017.
- 3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation. In ECCV, pages 452–468, 2018.
- Bundlefusion: Real-time globally gonsistent 3d reconstruction using on-the-fly surface reintegration. ACM TOG, 36(4):1, 2017.
- Learning regional purity for instance segmentation on 3d point clouds. In ECCV, pages 56–72. Springer, 2022.
- 3d-mpa: Multi-proposal aggregation for 3d semantic instance segmentation. In CVPR, pages 9031–9040, 2020.
- Density-based spatial clustering of applications with noise. In KDD, volume 240, page 6, 1996.
- Deep learning for 3d point clouds: A survey. PAMI, 43(12):4338–4364, 2020.
- Occuseg: Occupancy-aware 3d instance segmentation. In CVPR, pages 2940–2949, 2020.
- Mask r-cnn. In ICCV, pages 2961–2969, 2017.
- Dyco3d: Robust instance segmentation of 3d point clouds through dynamic convolution. In CVPR, pages 354–363, 2021.
- Pointinst3d: Segmenting 3d instances by points. In ECCV, pages 286–302. Springer, 2022.
- 3d-sis: 3d semantic instance segmentation of rgb-d scans. In CVPR, pages 4421–4430, 2019.
- Bidirectional projection network for cross dimensional scene understanding. In CVPR, pages 14373–14382, 2021.
- Mask scoring r-cnn. In CVPR, pages 6409–6418, 2019.
- Spatial transformer networks. In NeurIPS, pages 2017–2025, 2015.
- Acquisition of localization confidence for accurate object detection. In ECCV, pages 784–799, 2018.
- Pointgroup: Dual-set point grouping for 3d instance segmentation. In CVPR, pages 4867–4876, 2020.
- Adam: A method for stochastic optimization. In ICLR, page n.pag, 2015.
- Virtual multi-view fusion for 3f semantic segmentation. In ECCV, pages 518–535, 2020.
- 3d instance segmentation via multi-task metric learning. In ICCV, pages 9256–9266, 2019.
- Gs3d: An efficient 3d object detection framework for autonomous driving. In CVPR, pages 1019–1028, 2019.
- Instance segmentation in 3d scenes using semantic superpoint tree networks. In CVPR, pages 2783–2792, 2021.
- Hida: Towards holistic indoor understanding for the visually impaired via semantic instance segmentation with a wearable solid-state lidar sensor. In ICCV, pages 1780–1790, 2021.
- Faster r-cnn: Towards real-time object detection with region proposal networks. In NeurIPS, pages 91–99, 2015.
- Path aggregation network for instance segmentation. In CVPR, pages 8759–8768, 2018.
- Sgdr: Stochastic gradient descent with warm restarts. In ICLR, page n.pag, 2016.
- Voxnet: A 3d convolutional neural network for real-time object recognition. In PR, pages 922–928, 2015.
- Mix3d: Out-of-context data augmentation for 3d scenes. In 3DV, pages 116–125, 2021.
- Deep hough voting for 3d object detection in point clouds. In ICCV, pages 9277–9286, 2019.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In CVPR, pages 652–660, 2017.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In NeurIPS, pages 5099–5108, 2017.
- Fully-convolutional point networks for large-scale point clouds. In ECCV, pages 596–611, 2018.
- Octnet: Learning deep 3d representations at high resolutions. In CVPR, pages 3577–3586, 2017.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI, pages 234–241, 2015.
- Multi-view convolutional neural networks for 3d shape recognition. In ICCV, pages 945–953, 2015.
- Big data time series forecasting based on nearest neighbours distributed computing with spark. Knowledge-Based Systems, 161:12–25, 2018.
- Softgroup for 3d instance segmentation on 3d point clouds. In CVPR, 2022.
- Sgpn: Similarity group proposal network for 3d point cloud instance segmentation. In CVPR, pages 2569–2578, 2018.
- Dynamic graph cnn for learning on point clouds. ACM TOG, 38(5):1–12, 2019.
- Pointconv: Deep convolutional networks on 3d point clouds. In CVPR, pages 9621–9630, 2019.
- 3d instances as 1d kernels. In ECCV, pages 235–252, 2022.
- Mlcvnet: Multi-level context votenet for 3d object detection. In CVPR, pages 10447–10456, 2020.
- Learning object bounding boxes for 3d instance segmentation on point clouds. In NeurIPS, pages 6737–6746, 2019.
- Towards deeper and better multi-view feature fusion for 3d semantic segmentation. arXiv preprint arXiv:2212.06682, 2022.
- Gspn: Generative shape proposal network for 3d instance segmentation in point cloud. In CVPR, pages 3947–3956, 2019.
- Point cloud instance segmentation using probabilistic embeddings. In CVPR, pages 8883–8892, 2021.
- Ml-knn: A lazy learning approach to multi-label learning. PR, 40(7):2038–2048, 2007.
- K-net: Towards unified image segmentation. In NeurIPS, pages 10326–10338, 2021.
- Maskgroup: Hierarchical point grouping and masking for 3d instance segmentation. In ICME, pages 1–6, 2022.
- Weiguang Zhao (10 papers)
- Yuyao Yan (12 papers)
- Chaolong Yang (5 papers)
- Jianan Ye (7 papers)
- Xi Yang (160 papers)
- Kaizhu Huang (95 papers)