Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Teachers in concordance for pseudo-labeling of 3D sequential data (2207.06079v2)

Published 13 Jul 2022 in cs.CV and cs.RO

Abstract: Automatic pseudo-labeling is a powerful tool to tap into large amounts of sequential unlabeled data. It is specially appealing in safety-critical applications of autonomous driving, where performance requirements are extreme, datasets are large, and manual labeling is very challenging. We propose to leverage sequences of point clouds to boost the pseudolabeling technique in a teacher-student setup via training multiple teachers, each with access to different temporal information. This set of teachers, dubbed Concordance, provides higher quality pseudo-labels for student training than standard methods. The output of multiple teachers is combined via a novel pseudo label confidence-guided criterion. Our experimental evaluation focuses on the 3D point cloud domain and urban driving scenarios. We show the performance of our method applied to 3D semantic segmentation and 3D object detection on three benchmark datasets. Our approach, which uses only 20% manual labels, outperforms some fully supervised methods. A notable performance boost is achieved for classes rarely appearing in training data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (52)
  1. SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences. In Proc. of the IEEE/CVF International Conf. on Computer Vision (ICCV), 2019.
  2. The lovász-softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks. In CVPR, 2018.
  3. Benjamin Caine et al. Pseudo-labeling for scalable 3d object detection. arXiv, abs/2103.02093, 2021.
  4. Argoverse: 3D tracking and forecasting with rich maps. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  5. On the efficacy of knowledge distillation. In IEEE/CVF International Conference on Computer Vision (ICCV), October 2019.
  6. 4d spatio-temporal convnets: Minkowski convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019, pages 3075–3084. Computer Vision Foundation / IEEE, 2019.
  7. Salsanext: Fast, uncertainty-aware semantic segmentation of lidar point clouds for autonomous driving. In 15th International Symposium ISVC 2020.
  8. Tornado-net: multiview total variation semantic segmentation with diamond inception module. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 9543–9549. IEEE, 2021.
  9. Distilling the knowledge in a neural network. In NIPS Deep Learning and Representation Learning Workshop, 2015.
  10. What you see is what you get: Exploiting visibility for 3d object detection. In CVPR 2020, pages 10998–11006, 06 2020.
  11. Randla-net: Efficient semantic segmentation of large-scale point clouds. In Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11108–11117, 2020.
  12. Guided point contrastive learning for semi-supervised point cloud semantic segmentation. In Proc. of the IEEE/CVF International Conference on Computer Vision, pages 6423–6432, 2021.
  13. Deep learning under privileged information using heteroscedastic dropout. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8886–8895, 2018.
  14. D. Lee. Pseudo-label : The simple and efficient semi-supervised learning method for deep neural networks. In International Conference on Machine Learning (ICML). Workshop Challenges in Representation Learning, 2013.
  15. OPTIMOL: automatic online picture collection via incremental model learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, 2007.
  16. Self-loop uncertainty: A novel pseudo-label for semi-supervised medical image segmentation. The Medical Image Computing and Computer Assisted Intervetion, (MICCAI), pages 614–623, 09 2020.
  17. I Liu et al. Knowledge flow: Improve upon your teachers. In 7th International Conference on Learning Representations, (ICLR), 2019.
  18. Xingyu Liu et al. Meteornet: Deep learning on dynamic 3d point cloud sequences. In ICCV, 2019.
  19. Unbiased teacher for semi-supervised object detection. In International Conference on Learning Representations, (ICLR), 2021.
  20. Rangenet ++: Fast and accurate lidar semantic segmentation. 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 4213–4220, 2019.
  21. Improved knowledge distillation via teacher assistant: Bridging the gap between student and teacher. In 34th AAAI Conference on Artificial Intelligence, 2019.
  22. Offboard 3d object detection from point cloud sequences. In CVPR, 2021.
  23. PointNet: Deep learning on point sets for 3D classification and segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
  24. Pointnet++ and three layers of features fusion for occlusion three-dimensional ear recognition based on one sample per person. Symmetry, 12(1):78, 2020.
  25. Spsequencenet: Semantic segmentation network on 4d point clouds. In Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
  26. PointRCNN: 3D object proposal generation and detection from point cloud. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–779, 2019.
  27. Transductive semi-supervised deep learning using min-max features. In Proc. of the European Conference on Computer Vision (ECCV), September 2018.
  28. Fixmatch: Simplifying semi-supervised learning with consistency and confidence. In Advances in Neural Information Processing Systems, volume 33, pages 596–608. Curran Associates, Inc., 2020.
  29. Seg-Contrast: 3D point cloud feature representation learning through self-supervised segment discrimination. IEEE RAL, 2022
  30. xMUDA: Cross-modal unsupervised domain adaptation for 3d semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
  31. P. Jiang and S. Saripalli. Lidarnet: A boundary-aware domain adaptation model for point cloud semantic segmentation. In International conference robotics automation, 2021.
  32. Tangent convolutions for dense prediction in 3d. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3887–3896, 2018.
  33. Automatic labeling to generate training data for online lidar-based moving object segmentation. IEEE RA-L, page 6107–6114, 2022
  34. Tracking-based semi-supervised learning. IJRR, 31(7):804–818, 2012
  35. Pointcontrast: Unsupervised pre-training for 3D point cloud understanding. Europian conference in computer vision (ECCV), 2020.
  36. Similarity-preserving knowledge distillation. In ICCV, 2019
  37. L. Rokach. Ensemble Learning: Pattern Classification Using Ensemble Methods. Series in machine perception and artificial intelligence. 2019.
  38. Kpconv: Flexible and deformable convolution for point clouds. In Proc. of the IEEE/CVF international conference on computer vision, pages 6411–6420, 2019.
  39. Vladimir Vapnik et al. A new learning paradigm: Learning using privileged information. Neural Networks, 22(5):544–557, 2009. Advances in Neural Networks Research: IJCNN2009.
  40. Learning using privileged information: Similarity control and knowledge transfer. Journal of Machine Learning Research, 16(61):2023–2049, 2015.
  41. 3DIoUMatch: Leveraging IoU prediction for semi-supervised 3d object detection. arXiv preprint arXiv:2012.04355, 2020.
  42. Squeezesegv3: Spatially-adaptive convolution for efficient point-cloud segmentation. In European Conference on Computer Vision, pages 1–19. Springer, 2020.
  43. Deep fusionnet for point cloud semantic segmentation. In ECCV, 2020.
  44. Polarnet: An improved grid representation for online lidar point clouds semantic segmentation. In Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9601–9610, 2020.
  45. Sess: Self-ensembling semi-supervised 3d object detection. In Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
  46. Cylinder3d: An effective 3d framework for driving-scene lidar semantic segmentation. arXiv preprint arXiv:2008.01550, 2020.
  47. Instant-teaching: An end-to-end semi-supervised object detection framework. In Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4081–4090, June 2021.
  48. End-to-end multi-view fusion for 3d object detection in lidar point clouds. In CoRL 2019, 2019.
  49. Cylindrical and asymmetrical 3d convolution networks for lidar segmentation. In Proc. of the IEEE/CVF conference on computer vision and pattern recognition, pages 9939–9948, 2021.
  50. (AF)2-S3Net: Attentive Feature Fusion With Adaptive Feature Selection for Sparse Semantic Segmentation Network. In CVPR, 2021.
  51. Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation In CVPR, 2022.
  52. nuScenes: A Multimodal Dataset for Autonomous Driving. In CVPR, 2020.
Citations (5)

Summary

We haven't generated a summary for this paper yet.