Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 90 tok/s
Gemini 2.5 Pro 53 tok/s Pro
GPT-5 Medium 41 tok/s
GPT-5 High 42 tok/s Pro
GPT-4o 109 tok/s
GPT OSS 120B 477 tok/s Pro
Kimi K2 222 tok/s Pro
2000 character limit reached

Low-Resource White-Box Semantic Segmentation of Supporting Towers on 3D Point Clouds via Signature Shape Identification (2306.07809v1)

Published 13 Jun 2023 in cs.CV, cs.LG, and math.GT

Abstract: Research in 3D semantic segmentation has been increasing performance metrics, like the IoU, by scaling model complexity and computational resources, leaving behind researchers and practitioners that (1) cannot access the necessary resources and (2) do need transparency on the model decision mechanisms. In this paper, we propose SCENE-Net, a low-resource white-box model for 3D point cloud semantic segmentation. SCENE-Net identifies signature shapes on the point cloud via group equivariant non-expansive operators (GENEOs), providing intrinsic geometric interpretability. Our training time on a laptop is 85~min, and our inference time is 20~ms. SCENE-Net has 11 trainable geometrical parameters and requires fewer data than black-box models. SCENE--Net offers robustness to noisy labeling and data imbalance and has comparable IoU to state-of-the-art methods. With this paper, we release a 40~000 Km labeled dataset of rural terrain point clouds and our code implementation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (50)
  1. Zachary C Lipton. The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue, 16(3):31–57, 2018.
  2. A survey of methods for explaining black box models. ACM computing surveys (CSUR), 51(5):1–42, 2018.
  3. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608, 2017.
  4. Cynthia Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5):206–215, 2019.
  5. Transformer interpretability beyond attention visualization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 782–791, June 2021.
  6. Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned. arXiv preprint arXiv:1905.09418, 2019.
  7. " why should i trust you?" explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 1135–1144, 2016.
  8. Interpretable explanations of black boxes by meaningful perturbation. In Proceedings of the IEEE international conference on computer vision, pages 3429–3437, 2017.
  9. Concept whitening for interpretable image recognition. Nature Machine Intelligence, 2(12):772–782, 2020.
  10. Towards a topological–geometrical theory of group equivariant non-expansive operators for data analysis and machine learning. Nature Machine Intelligence, 1(9):423–433, 2019.
  11. On the geometric and riemannian structure of the spaces of group equivariant non-expansive operators. arXiv preprint arXiv:2103.02543, 2021.
  12. Semantic3d. net: A new large-scale point cloud classification benchmark. arXiv preprint arXiv:1704.03847, 2017.
  13. Semantickitti: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9297–9307, 2019.
  14. Deep learning for safe autonomous driving: Current challenges and future directions. IEEE Transactions on Intelligent Transportation Systems, 22(7):4316–4336, 2020.
  15. Review of deep learning: Concepts, cnn architectures, challenges, applications, future directions. Journal of big Data, 8(1):1–74, 2021.
  16. Kpconv: Flexible and deformable convolution for point clouds. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6411–6420, 2019.
  17. Searching efficient 3d architectures with sparse point-voxel convolution. In European conference on computer vision, pages 685–702. Springer, 2020.
  18. Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 16024–16033, 2021.
  19. Sparse single sweep lidar point cloud segmentation via learning contextual shape priors from scene completion. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 3101–3109, 2021.
  20. 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds. arXiv e-prints, page arXiv:2207.04397, July 2022.
  21. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 3431–3440, 2015.
  22. Fully-convolutional point networks for large-scale point clouds. In Proceedings of the European Conference on Computer Vision (ECCV), pages 596–611, 2018.
  23. Segcloud: Semantic segmentation of 3d point clouds. In 2017 International Conference on 3D vision (3DV), pages 537–547. IEEE, 2017.
  24. 3d semantic segmentation with submanifold sparse convolutional networks. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 9224–9232, 2018.
  25. Splatnet: Sparse lattice networks for point cloud processing. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2530–2539, 2018.
  26. O-CNN: Octree-based convolutional neural networks for 3d shape analysis. ACM Transactions On Graphics (TOG), 36(4):1–11, 2017.
  27. Truc Le and Ye Duan. Pointgrid: A deep network for 3d shape understanding. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 9204–9214, 2018.
  28. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 652–660, 2017.
  29. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30, 2017.
  30. Randla-net: Efficient semantic segmentation of large-scale point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11108–11117, 2020.
  31. Geometry sharing network for 3d point cloud classification and segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 12500–12507, 2020.
  32. Pointwise convolutional neural networks. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 984–993, 2018.
  33. PointCNN: Convolution on x-transformed points. Advances in Neural Information Processing Systems, 31, 2018.
  34. Pointconv: Deep convolutional networks on 3d point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9621–9630, 2019.
  35. Cylindrical and asymmetrical 3d convolution networks for lidar segmentation. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 9939–9948, 2021.
  36. Towards semantic segmentation of urban-scale 3d point clouds: A dataset, benchmarks and challenges. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4977–4987, 2021.
  37. 2-s3net: Attentive feature fusion with adaptive feature selection for sparse semantic segmentation network. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 12547–12556, 2021.
  38. Deep learning for 3d point clouds: A survey. IEEE transactions on pattern analysis and machine intelligence, 43(12):4338–4364, 2020.
  39. Anchors: High-precision model-agnostic explanations. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
  40. Aligning artificial neural networks and ontologies towards explainable ai. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 4932–4940, 2021.
  41. Interpretable convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018.
  42. Geneonet: A new machine learning paradigm based on group equivariant non-expansive operators. an application to protein pocket detection. arXiv preprint arXiv:2202.00451, 2022.
  43. On the construction of group equivariant non-expansive operators via permutants and symmetric functions. Frontiers in Artificial Intelligence, 5:16, 2022.
  44. On the finite representation of group equivariant operators via permutant measures. arXiv preprint arXiv:2008.06340, 2020.
  45. Density-based weighting for imbalanced regression. Machine Learning, 110(8):2187–2211, 2021.
  46. Tversky loss function for image segmentation using 3d fully convolutional deep networks. In Machine Learning in Medical Imaging: 8th International Workshop, MLMI 2017, Held in Conjunction with MICCAI 2017, Quebec City, QC, Canada, September 10, 2017, Proceedings 8, pages 379–387. Springer, 2017.
  47. Tangent convolutions for dense prediction in 3d. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3887–3896, 2018.
  48. Electric power line patrol operation based on vision and laser slam fusion perception. In 2021 IEEE 4th International Conference on Automation, Electronics and Electrical Engineering (AUTEEE), pages 125–129. IEEE, 2021.
  49. Research on point cloud power line segmentation and fitting algorithm. In 2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), volume 1, pages 2404–2409. IEEE, 2019.
  50. Study on segmentation algorithm with missing point cloud in power line. In 2019 IEEE 3rd Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), pages 1895–1899. IEEE, 2019.
Citations (1)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.