CurveCloudNet: Processing Point Clouds with 1D Structure (2303.12050v2)
Abstract: Modern depth sensors such as LiDAR operate by sweeping laser-beams across the scene, resulting in a point cloud with notable 1D curve-like structures. In this work, we introduce a new point cloud processing scheme and backbone, called CurveCloudNet, which takes advantage of the curve-like structure inherent to these sensors. While existing backbones discard the rich 1D traversal patterns and rely on generic 3D operations, CurveCloudNet parameterizes the point cloud as a collection of polylines (dubbed a "curve cloud"), establishing a local surface-aware ordering on the points. By reasoning along curves, CurveCloudNet captures lightweight curve-aware priors to efficiently and accurately reason in several diverse 3D environments. We evaluate CurveCloudNet on multiple synthetic and real datasets that exhibit distinct 3D size and structure. We demonstrate that CurveCloudNet outperforms both point-based and sparse-voxel backbones in various segmentation settings, notably scaling to large scenes better than point-based alternatives while exhibiting improved single-object performance over sparse-voxel alternatives. In all, CurveCloudNet is an efficient and accurate backbone that can handle a larger variety of 3D environments than past works.
- Summer robotics. https://www.summerrobotics.ai/. Accessed: 2023-02-15.
- Rangevit: Towards vision transformers for 3d semantic segmentation in autonomous driving. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5240–5250, 2023.
- Point convolutional neural networks by extension operators. ACM Trans. on Graphics, 2018.
- SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences. In Proc. of the IEEE/CVF International Conf. on Computer Vision (ICCV), 2019.
- The lovász-softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Guy E. Blelloch. Prefix sums and their applications. 1990.
- Fkaconv: Feature-kernel alignment for point cloud convolution. 2020.
- nuscenes: A multimodal dataset for autonomous driving. 2020.
- ShapeNet: An Information-Rich 3D Model Repository. Technical Report arXiv:1512.03012 [cs.GR], Stanford University — Princeton University — Toyota Technological Institute at Chicago, 2015a.
- Shapenet: An information-rich 3d model repository. arXiv.org, 1512.03012, 2015b.
- A hierarchical graph network for 3d object detection on point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
- Cenet: Toward concise and efficient lidar semantic segmentation for autonomous driving. 2022 IEEE International Conference on Multimedia and Expo (ICME), pages 01–06, 2022.
- (af)2-s3net: Attentive feature fusion with adaptive feature selection for sparse semantic segmentation network. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12542–12551, 2021.
- 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Diffgcn: Graph convolutional networks via differential operators and algebraic multigrid pooling. In Advances in Neural Information Processing Systems (NeurIPS), 2020.
- Learning SO(3) equivariant representations with spherical cnns. In Proc. of the European Conf. on Computer Vision (ECCV), 2018.
- Point 4d transformer networks for spatio-temporal modeling in point cloud videos. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
- Fast graph representation learning with PyTorch Geometric. In ICLR Workshop on Representation Learning on Graphs and Manifolds, 2019.
- Are we ready for autonomous driving? The KITTI vision benchmark suite. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2012.
- A2D2: audi autonomous driving dataset. arXiv.org, 2020.
- 3d semantic segmentation with submanifold sparse convolutional networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Maskrange: A mask-classification model for range-view based lidar segmentation. ArXiv, abs/2206.12073, 2022.
- Point-to-voxel knowledge distillation for lidar semantic segmentation. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8469–8478, 2022.
- Randla-net: Efficient semantic segmentation of large-scale point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
- Pointwise convolutional neural networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Batch normalization: Accelerating deep network training by reducing internal covariate shift. pages 448–456, 2015.
- Adam: A method for stochastic optimization. In Proc. of the International Conf. on Learning Representations (ICLR), 2015.
- A-CNN: annularly convolutional neural networks on point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Rethinking range view representation for lidar segmentation. ArXiv, abs/2303.05367, 2023.
- Spherical transformer for lidar-based 3d recognition. In CVPR, 2023.
- Modeling local geometric structure of 3d point clouds using geo-cnn. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Pointpillars: Fast encoders for object detection from point clouds. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12689–12697, 2018.
- Octree guided CNN with spherical kernels for 3d point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- So-net: Self-organizing network for point cloud analysis. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Dynamic points agglomeration for hierarchical point sets learning. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2019a.
- Point2sequence: Learning the shape representation of 3d point clouds with an attention-based sequence to sequence network. In Proceedings of the AAAI conference on artificial intelligence, 2019b.
- Relation-shape convolutional neural network for point cloud analysis. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019c.
- Point-voxel CNN for efficient 3d deep learning. In Advances in Neural Information Processing Systems (NeurIPS), 2019d.
- Pvnas: 3d neural architecture search with point-voxel convolution. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44:8552–8568, 2021.
- Rethinking network design and local geometry in point cloud: A simple residual mlp framework. In Proc. of the International Conf. on Learning Representations (ICLR), 2022.
- Interpolated convolutional networks for 3d point cloud understanding. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2019.
- Voxnet: A 3d convolutional neural network for real-time object recognition. In Proc. IEEE International Conf. on Intelligent Robots and Systems (IROS), 2015.
- An end-to-end transformer model for 3d object detection. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2021.
- Volumetric and multi-view cnns for object classification on 3d data. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5648–5656, 2016.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2017a.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In Advances in Neural Information Processing Systems (NIPS), 2017b.
- Pointnext: Revisiting pointnet++ with improved training and scaling strategies. ArXiv, abs/2206.04670, 2022.
- Accelerating 3d deep learning with pytorch3d. arXiv:2007.08501, 2020.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2015.
- Mining point cloud local structures by kernel correlation and graph pooling. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Splatnet: Sparse lattice networks for point cloud processing. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- RGCNN: regularized graph CNN for point cloud segmentation. In ACM Trans. on Graphics, 2018.
- Kpconv: Flexible and deformable convolution for point clouds. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2019.
- Attention is all you need. In Advances in Neural Information Processing Systems (NIPS), pages 5998–6008, 2017.
- Feastnet: Feature-steered graph convolutions for 3d shape analysis. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Deep parametric continuous convolutional neural networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Object DGCNN: 3d object detection using dynamic graphs. In Advances in Neural Information Processing Systems (NeurIPS), 2021.
- Dynamic graph CNN for learning on point clouds. ACM Trans. on Graphics, 2019a.
- A parametric top-view representation of complex road scenes. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019b.
- Deltaconv: anisotropic operators for geometric deep learning on point clouds. ACM Trans. on Graphics, 2022.
- Squeezeseg: Convolutional neural nets with recurrent CRF for real-time road-object segmentation from 3d lidar point cloud. In Proc. IEEE International Conf. on Robotics and Automation (ICRA), 2018.
- Casa: A cascade attention network for 3-d object detection from lidar point clouds. IEEE Trans. Geosci. Remote. Sens., 2022a.
- Transformation-equivariant 3d object detection for autonomous driving. arXiv.org, 2022b.
- Pointconv: Deep convolutional networks on 3d point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Point transformer v2: Grouped vector attention and partition-based pooling. In NeurIPS, 2022c.
- Sparse fuse dense: Towards high quality 3d detection with depth completion. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022d.
- Walk in the cloud: Learning curves for point clouds shape analysis. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2021.
- Attentional shapecontextnet for point cloud recognition. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Squeezesegv3: Spatially-adaptive convolution for efficient point-cloud segmentation. In Proc. of the European Conf. on Computer Vision (ECCV), 2020.
- Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2021a.
- Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 16004–16013, 2021b.
- Paconv: Position adaptive convolution with dynamic kernel assembling on point clouds. 2021c.
- Spidercnn: Deep learning on point sets with parameterized convolutional filters. In Proc. of the European Conf. on Computer Vision (ECCV), 2018.
- Sparse single sweep lidar point cloud segmentation via learning contextual shape priors from scene completion. ArXiv, abs/2012.03762, 2020.
- 2dpass: 2d priors assisted semantic segmentation on lidar point clouds. In European Conference on Computer Vision, 2022.
- Second: Sparsely embedded convolutional detection. Sensors, 2018.
- Neumesh: Learning disentangled neural mesh-based implicit field for geometry and texture editing. In Proc. of the European Conf. on Computer Vision (ECCV), 2022.
- Modeling point clouds with self-attention and gumbel subset sampling. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- 3dssd: Point-based 3d single stage object detector. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11037–11045, 2020.
- A scalable active framework for region annotation in 3d shape collections. ACM Transactions on Graphics (ToG), 2016.
- Pvt: Point-voxel transformer for point cloud learning. arXiv.org, 2022a.
- PCAN: 3d attention map learning using contextual information for point cloud based retrieval. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Polarnet: An improved grid representation for online lidar point clouds semantic segmentation. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
- Not all points are equal: Learning highly efficient point-based detectors for 3d lidar point clouds. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022b.
- Point transformer. In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2021a.
- Fidnet: Lidar point cloud semantic segmentation with fully interpolation decoding. 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 4453–4458, 2021b.
- Cylinder3d: An effective 3d framework for driving-scene lidar semantic segmentation. arXiv.org, 2020.
- Open3D: A modern library for 3D data processing. arXiv:1801.09847, 2018.