Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Point2Point : A Framework for Efficient Deep Learning on Hilbert sorted Point Clouds with applications in Spatio-Temporal Occupancy Prediction (2306.16306v1)

Published 28 Jun 2023 in cs.CV

Abstract: The irregularity and permutation invariance of point cloud data pose challenges for effective learning. Conventional methods for addressing this issue involve converting raw point clouds to intermediate representations such as 3D voxel grids or range images. While such intermediate representations solve the problem of permutation invariance, they can result in significant loss of information. Approaches that do learn on raw point clouds either have trouble in resolving neighborhood relationships between points or are too complicated in their formulation. In this paper, we propose a novel approach to representing point clouds as a locality preserving 1D ordering induced by the Hilbert space-filling curve. We also introduce Point2Point, a neural architecture that can effectively learn on Hilbert-sorted point clouds. We show that Point2Point shows competitive performance on point cloud segmentation and generation tasks. Finally, we show the performance of Point2Point on Spatio-temporal Occupancy prediction from Point clouds.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.
  2. A comparative analysis of some two-dimensional orderings. International Journal of Geographical Information Systems, 4(1):21–31, 1990.
  3. Representation learning and adversarial generation of 3d point clouds. CoRR, abs/1707.02392, 2017.
  4. Salsanet: Fast road and vehicle segmentation in lidar point clouds for autonomous driving. CoRR, abs/1909.08291, 2019.
  5. 3d semantic parsing of large-scale indoor spaces. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1534–1543, 2016.
  6. Arthur R. Butz. Convergence with hilbert’s space filling curve. Journal of Computer and System Sciences, 3(2):128–146, 1969.
  7. Shapenet: An information-rich 3d model repository. CoRR, abs/1512.03012, 2015.
  8. Pointnet: Deep learning on point sets for 3d classification and segmentation. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 77–85, 2017.
  9. RangeSeg: Range-aware real time segmentation of 3d LiDAR point clouds. IEEE Transactions on Intelligent Vehicles, 7(1):93–101, mar 2022.
  10. Marco Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. In C.J. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 26. Curran Associates, Inc., 2013.
  11. OTA: optimal transport assignment for object detection. CoRR, abs/2103.14259, 2021.
  12. Vision meets robotics: The kitti dataset. International Journal of Robotics Research (IJRR), 2013.
  13. AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
  14. Semantic3d.net: A new large-scale point cloud classification benchmark. CoRR, abs/1704.03847, 2017.
  15. Squeeze-and-excitation networks. CoRR, abs/1709.01507, 2017.
  16. Randla-net: Efficient semantic segmentation of large-scale point clouds. CoRR, abs/1911.11236, 2019.
  17. Fast hilbert sort algorithm without using hilbert indices. pages 259–267, 10 2016.
  18. Fast hilbert sort algorithm without using hilbert indices. In SISAP, 2016.
  19. H. V. Jagadish. Linear clustering of objects with multiple attributes. SIGMOD Rec., 19(2):332–342, may 1990.
  20. Adam: A method for stochastic optimization, 2014.
  21. Pointcnn. CoRR, abs/1801.07791, 2018.
  22. 3d convolutional neural networks for landing zone detection from lidar. In 2015 IEEE International Conference on Robotics and Automation (ICRA), pages 3471–3478, 2015.
  23. Voxnet: A 3d convolutional neural network for real-time object recognition. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 922–928, 2015.
  24. Self-supervised point cloud prediction using 3d spatio-temporal convolutional networks. CoRR, abs/2110.04076, 2021.
  25. Multi-step prediction of occupancy grid maps with recurrent neural networks. CoRR, abs/1812.09395, 2018.
  26. Analysis of the clustering properties of the hilbert space-filling curve. IEEE Transactions on Knowledge and Data Engineering, 13(1):124–141, 2001.
  27. Tearingnet: Point cloud autoencoder to learn topology-friendly representations. CoRR, abs/2006.10187, 2020.
  28. Volumetric and multi-view cnns for object classification on 3d data. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5648–5656, 2016.
  29. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. CoRR, abs/1706.02413, 2017.
  30. Hans Sagan. A three-dimensional hilbert curve. International Journal of Mathematical Education in Science and Technology, 24(4):541–545, 1993.
  31. Hans Sagan. Space-Filling Curves. Springer New York, 1994.
  32. Kpconv: Flexible and deformable convolution for point clouds. CoRR, abs/1904.08889, 2019.
  33. R-PCC: A baseline for range image-based point cloud compression. CoRR, abs/2109.07717, 2021.
  34. Dynamic graph CNN for learning on point clouds. CoRR, abs/1801.07829, 2018.
  35. Spidercnn: Deep learning on point sets with parameterized convolutional filters. CoRR, abs/1803.11527, 2018.
  36. Foldingnet: Point cloud auto-encoder via deep grid deformation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 206–215, 2018.
  37. Bisenet V2: bilateral network with guided aggregation for real-time semantic segmentation. CoRR, abs/2004.02147, 2020.
  38. Bisenet: Bilateral segmentation network for real-time semantic segmentation. CoRR, abs/1808.00897, 2018.
  39. Shellnet: Efficient point cloud convolutional neural networks using concentric shells statistics. CoRR, abs/1908.06295, 2019.

Summary

We haven't generated a summary for this paper yet.