Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds (2403.18469v1)
Abstract: 3D synthetic-to-real unsupervised domain adaptive segmentation is crucial to annotating new domains. Self-training is a competitive approach for this task, but its performance is limited by different sensor sampling patterns (i.e., variations in point density) and incomplete training strategies. In this work, we propose a density-guided translator (DGT), which translates point density between domains, and integrates it into a two-stage self-training pipeline named DGT-ST. First, in contrast to existing works that simultaneously conduct data generation and feature/output alignment within unstable adversarial training, we employ the non-learnable DGT to bridge the domain gap at the input level. Second, to provide a well-initialized model for self-training, we propose a category-level adversarial network in stage one that utilizes the prototype to prevent negative transfer. Finally, by leveraging the designs above, a domain-mixed self-training method with source-aware consistency loss is proposed in stage two to narrow the domain gap further. Experiments on two synthetic-to-real segmentation tasks (SynLiDAR $\rightarrow$ semanticKITTI and SynLiDAR $\rightarrow$ semanticPOSS) demonstrate that DGT-ST outperforms state-of-the-art methods, achieving 9.4$\%$ and 4.3$\%$ mIoU improvements, respectively. Code is available at \url{https://github.com/yuan-zm/DGT-ST}.
- RangeViT: Towards vision transformers for 3d semantic segmentation in autonomous driving. In CVPR, pages 5240–5250, 2023.
- SemanticKITTI: A dataset for semantic scene understanding of LiDAR sequences. In ICCV, pages 9297–9307, 2019.
- 4D Spatio-Temporal ConvNeTs: Minkowski convolutional neural networks. In CVPR, pages 3075–3084, 2019.
- SalsaNext: Fast, uncertainty-aware semantic segmentation of lidar point clouds for autonomous driving. arXiv preprint arXiv:2003.03653, 2020.
- RandLA-Net: Efficient semantic segmentation of large-scale point clouds. In CVPR, pages 11108–11117, 2020.
- LiDARNet: A boundary-aware domain adaptation model for point cloud semantic segmentation. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 2457–2464, 2021.
- Adam: A method for stochastic optimization. In ICLR, 2014.
- Rethinking range view representation for lidar segmentation. In ICCV, pages 228–240, 2023a.
- ConDA: Unsupervised domain adaptation for lidar segmentation via regularized domain concatenation. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 9338–9345. IEEE, 2023b.
- LaserMix for semi-supervised lidar semantic segmentation. In CVPR, pages 21705–21715, 2023c.
- Stratified transformer for 3d point cloud segmentation. In CVPR, pages 8500–8509, 2022.
- Spherical transformer for lidar-based 3d recognition. In CVPR, pages 17545–17555, 2023.
- Adversarially Masking Synthetic to Mimic Real: Adaptive noise injection for point cloud segmentation adaptation. In CVPR, pages 20464–20474, 2023.
- Taking a closer look at domain shift: Category-level adversaries for semantics consistent domain adaptation. In CVPR, pages 2507–2516, 2019.
- Least squares generative adversarial networks. In ICCV, pages 2794–2802, 2017.
- SemanticPOSS: A point cloud dataset with large quantity of dynamic instances. In IEEE Intelligent Vehicles Symposium (IV), pages 687–693. IEEE, 2020.
- Self-positioning point-based transformer for point cloud understanding. In CVPR, pages 21814–21823, 2023.
- PyTorch: An imperative style, high-performance deep learning library. NeurIPS, 32, 2019.
- PointNet: Deep learning on point sets for 3d classification and segmentation. In CVPR, pages 652–660, 2017a.
- PointNet++: Deep hierarchical feature learning on point sets in a metric space. NeurIPS, 30, 2017b.
- PointNeXt: Revisiting PointNet++ with improved training and scaling strategies. NeurIPS, 35:23192–23204, 2022.
- CoSMix: Compositional semantic mix for domain adaptation in 3d lidar segmentation. In ECCV, pages 586–602. Springer, 2022.
- LiDAR-UDA: Self-ensembling through time for unsupervised lidar domain adaptation. In ICCV, pages 19784–19794, 2023.
- Searching efficient 3d architectures with sparse point-voxel convolution. In ECCV, pages 685–702. Springer, 2020.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. NeurIPS, 30, 2017.
- Learning to adapt structured output space for semantic segmentation. In CVPR, pages 7472–7481, 2018.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-SNE. J. Mach. Learn. Res., 9(11), 2008.
- ADVENT: Adversarial entropy minimization for domain adaptation in semantic segmentation. In CVPR, pages 2517–2526, 2019.
- Classes Matter: A fine-grained adversarial approach to cross-domain semantic segmentation. In ECCV, pages 642–659, 2020.
- LiDAR Distillation: Bridging the beam-induced domain gap for 3d object detection. In ECCV, pages 179–195. Springer, 2022.
- SqueezeSeg: Convolutional neural nets with recurrent crf for real-time road-object segmentation from 3d lidar point cloud. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 1887–1893. IEEE, 2018.
- SqueezeSegV2: Improved model structure and unsupervised domain adaptation for road-object segmentation from a lidar point cloud. In Proc. IEEE Int. Conf. Robot. Autom. (ICRA), pages 4376–4382. IEEE, 2019.
- Virtual sparse convolution for multimodal 3d object detection. In CVPR, pages 21653–21662, 2023.
- PolarMix: A general data augmentation technique for lidar point clouds. In NeurIPS, 2022a.
- Transfer learning from synthetic to real lidar point cloud for semantic segmentation. In AAAI, pages 2795–2803, 2022b.
- RPVNet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation. In CVPR, pages 16024–16033, 2021.
- PointASNL: Robust point clouds processing using nonlocal neural networks with adaptive sampling. In CVPR, pages 5589–5598, 2020.
- 2DPASS: 2d priors assisted semantic segmentation on lidar point clouds. In ECCV, pages 677–695. Springer, 2022.
- Complete & Label: A domain adaptation approach to semantic segmentation of LiDAR point clouds. In CVPR, pages 15363–15373, 2021.
- Category-level adversaries for outdoor lidar point clouds cross-domain semantic segmentation. IEEE Trans. Intell. Transp. Syst., 24(2):1982–1993, 2022.
- Prototype-guided multitask adversarial network for cross-domain lidar point clouds semantic segmentation. IEEE Trans. Geosci. Remote Sens., 61:1–13, 2023.
- Prototypical pseudo label denoising and target structure learning for domain adaptive semantic segmentation. In CVPR, pages 12414–12424, 2021.
- Category anchor-guided unsupervised domain adaptation for semantic segmentation. In NeurIPS, 2019.
- Point transformer. In ICCV, pages 16259–16268, 2021a.
- ePointDA: An end-to-end simulation-to-real domain adaptation framework for lidar point cloud segmentation. In AAAI, pages 3500–3509, 2021b.
- Unsupervised scene adaptation with memory regularization in vivo. In IJCAI, pages 1076–1082, 2020.
- Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV, pages 2223–2232, 2017.
- Cylindrical and asymmetrical 3d convolution networks for lidar segmentation. In CVPR, pages 9939–9948, 2021.
- Zhimin Yuan (6 papers)
- Wankang Zeng (1 paper)
- Yanfei Su (2 papers)
- Weiquan Liu (10 papers)
- Ming Cheng (69 papers)
- Yulan Guo (89 papers)
- Cheng Wang (386 papers)