Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds (2403.18469v1)

Published 27 Mar 2024 in cs.CV and cs.AI

Abstract: 3D synthetic-to-real unsupervised domain adaptive segmentation is crucial to annotating new domains. Self-training is a competitive approach for this task, but its performance is limited by different sensor sampling patterns (i.e., variations in point density) and incomplete training strategies. In this work, we propose a density-guided translator (DGT), which translates point density between domains, and integrates it into a two-stage self-training pipeline named DGT-ST. First, in contrast to existing works that simultaneously conduct data generation and feature/output alignment within unstable adversarial training, we employ the non-learnable DGT to bridge the domain gap at the input level. Second, to provide a well-initialized model for self-training, we propose a category-level adversarial network in stage one that utilizes the prototype to prevent negative transfer. Finally, by leveraging the designs above, a domain-mixed self-training method with source-aware consistency loss is proposed in stage two to narrow the domain gap further. Experiments on two synthetic-to-real segmentation tasks (SynLiDAR $\rightarrow$ semanticKITTI and SynLiDAR $\rightarrow$ semanticPOSS) demonstrate that DGT-ST outperforms state-of-the-art methods, achieving 9.4$\%$ and 4.3$\%$ mIoU improvements, respectively. Code is available at \url{https://github.com/yuan-zm/DGT-ST}.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (48)
  1. RangeViT: Towards vision transformers for 3d semantic segmentation in autonomous driving. In CVPR, pages 5240–5250, 2023.
  2. SemanticKITTI: A dataset for semantic scene understanding of LiDAR sequences. In ICCV, pages 9297–9307, 2019.
  3. 4D Spatio-Temporal ConvNeTs: Minkowski convolutional neural networks. In CVPR, pages 3075–3084, 2019.
  4. SalsaNext: Fast, uncertainty-aware semantic segmentation of lidar point clouds for autonomous driving. arXiv preprint arXiv:2003.03653, 2020.
  5. RandLA-Net: Efficient semantic segmentation of large-scale point clouds. In CVPR, pages 11108–11117, 2020.
  6. LiDARNet: A boundary-aware domain adaptation model for point cloud semantic segmentation. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 2457–2464, 2021.
  7. Adam: A method for stochastic optimization. In ICLR, 2014.
  8. Rethinking range view representation for lidar segmentation. In ICCV, pages 228–240, 2023a.
  9. ConDA: Unsupervised domain adaptation for lidar segmentation via regularized domain concatenation. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 9338–9345. IEEE, 2023b.
  10. LaserMix for semi-supervised lidar semantic segmentation. In CVPR, pages 21705–21715, 2023c.
  11. Stratified transformer for 3d point cloud segmentation. In CVPR, pages 8500–8509, 2022.
  12. Spherical transformer for lidar-based 3d recognition. In CVPR, pages 17545–17555, 2023.
  13. Adversarially Masking Synthetic to Mimic Real: Adaptive noise injection for point cloud segmentation adaptation. In CVPR, pages 20464–20474, 2023.
  14. Taking a closer look at domain shift: Category-level adversaries for semantics consistent domain adaptation. In CVPR, pages 2507–2516, 2019.
  15. Least squares generative adversarial networks. In ICCV, pages 2794–2802, 2017.
  16. SemanticPOSS: A point cloud dataset with large quantity of dynamic instances. In IEEE Intelligent Vehicles Symposium (IV), pages 687–693. IEEE, 2020.
  17. Self-positioning point-based transformer for point cloud understanding. In CVPR, pages 21814–21823, 2023.
  18. PyTorch: An imperative style, high-performance deep learning library. NeurIPS, 32, 2019.
  19. PointNet: Deep learning on point sets for 3d classification and segmentation. In CVPR, pages 652–660, 2017a.
  20. PointNet++: Deep hierarchical feature learning on point sets in a metric space. NeurIPS, 30, 2017b.
  21. PointNeXt: Revisiting PointNet++ with improved training and scaling strategies. NeurIPS, 35:23192–23204, 2022.
  22. CoSMix: Compositional semantic mix for domain adaptation in 3d lidar segmentation. In ECCV, pages 586–602. Springer, 2022.
  23. LiDAR-UDA: Self-ensembling through time for unsupervised lidar domain adaptation. In ICCV, pages 19784–19794, 2023.
  24. Searching efficient 3d architectures with sparse point-voxel convolution. In ECCV, pages 685–702. Springer, 2020.
  25. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. NeurIPS, 30, 2017.
  26. Learning to adapt structured output space for semantic segmentation. In CVPR, pages 7472–7481, 2018.
  27. Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-SNE. J. Mach. Learn. Res., 9(11), 2008.
  28. ADVENT: Adversarial entropy minimization for domain adaptation in semantic segmentation. In CVPR, pages 2517–2526, 2019.
  29. Classes Matter: A fine-grained adversarial approach to cross-domain semantic segmentation. In ECCV, pages 642–659, 2020.
  30. LiDAR Distillation: Bridging the beam-induced domain gap for 3d object detection. In ECCV, pages 179–195. Springer, 2022.
  31. SqueezeSeg: Convolutional neural nets with recurrent crf for real-time road-object segmentation from 3d lidar point cloud. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 1887–1893. IEEE, 2018.
  32. SqueezeSegV2: Improved model structure and unsupervised domain adaptation for road-object segmentation from a lidar point cloud. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 4376–4382. IEEE, 2019.
  33. Virtual sparse convolution for multimodal 3d object detection. In CVPR, pages 21653–21662, 2023.
  34. PolarMix: A general data augmentation technique for lidar point clouds. In NeurIPS, 2022a.
  35. Transfer learning from synthetic to real lidar point cloud for semantic segmentation. In AAAI, pages 2795–2803, 2022b.
  36. RPVNet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation. In CVPR, pages 16024–16033, 2021.
  37. PointASNL: Robust point clouds processing using nonlocal neural networks with adaptive sampling. In CVPR, pages 5589–5598, 2020.
  38. 2DPASS: 2d priors assisted semantic segmentation on lidar point clouds. In ECCV, pages 677–695. Springer, 2022.
  39. Complete & Label: A domain adaptation approach to semantic segmentation of LiDAR point clouds. In CVPR, pages 15363–15373, 2021.
  40. Category-level adversaries for outdoor lidar point clouds cross-domain semantic segmentation. IEEE Trans. Intell. Transp. Syst., 24(2):1982–1993, 2022.
  41. Prototype-guided multitask adversarial network for cross-domain lidar point clouds semantic segmentation. IEEE Trans. Geosci. Remote Sens., 61:1–13, 2023.
  42. Prototypical pseudo label denoising and target structure learning for domain adaptive semantic segmentation. In CVPR, pages 12414–12424, 2021.
  43. Category anchor-guided unsupervised domain adaptation for semantic segmentation. In NeurIPS, 2019.
  44. Point transformer. In ICCV, pages 16259–16268, 2021a.
  45. ePointDA: An end-to-end simulation-to-real domain adaptation framework for lidar point cloud segmentation. In AAAI, pages 3500–3509, 2021b.
  46. Unsupervised scene adaptation with memory regularization in vivo. In IJCAI, pages 1076–1082, 2020.
  47. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV, pages 2223–2232, 2017.
  48. Cylindrical and asymmetrical 3d convolution networks for lidar segmentation. In CVPR, pages 9939–9948, 2021.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Zhimin Yuan (6 papers)
  2. Wankang Zeng (1 paper)
  3. Yanfei Su (2 papers)
  4. Weiquan Liu (10 papers)
  5. Ming Cheng (69 papers)
  6. Yulan Guo (89 papers)
  7. Cheng Wang (386 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com

GitHub