Class Balanced Dynamic Acquisition for Domain Adaptive Semantic Segmentation using Active Learning (2311.14146v1)
Abstract: Domain adaptive active learning is leading the charge in label-efficient training of neural networks. For semantic segmentation, state-of-the-art models jointly use two criteria of uncertainty and diversity to select training labels, combined with a pixel-wise acquisition strategy. However, we show that such methods currently suffer from a class imbalance issue which degrades their performance for larger active learning budgets. We then introduce Class Balanced Dynamic Acquisition (CBDA), a novel active learning method that mitigates this issue, especially in high-budget regimes. The more balanced labels increase minority class performance, which in turn allows the model to outperform the previous baseline by 0.6, 1.7, and 2.4 mIoU for budgets of 5%, 10%, and 20%, respectively. Additionally, the focus on minority classes leads to improvements of the minimum class performance of 0.5, 2.9, and 4.6 IoU respectively. The top-performing model even exceeds the fully supervised baseline, showing that a more balanced label than the entire ground truth can be beneficial.
- Minority class oriented active learning for imbalanced datasets. 2020 25th International Conference on Pattern Recognition (ICPR), pages 9920–9927, 2021.
- Class-balanced active learning for image classification. 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 3707–3716, 2021.
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In European Conference on Computer Vision, 2018.
- Iterative loop method combining active and semi-supervised learning for domain adaptive semantic segmentation. ArXiv, 2023.
- Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2015.
- Multi-anchor active domain adaptation for semantic segmentation. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 9092–9102, 2021.
- U-Net: Convolutional networks for biomedical image segmentation. In 2015 Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 234–241. Springer, 2015.
- Gradient and log-based active learning for semantic segmentation of crop and weed for agricultural robots. 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 1350–1356, 2020. URL https://api.semanticscholar.org/CorpusID:212655999.
- Labor: Labeling only if required for domain adaptive semantic segmentation. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 8568–8578, 2021.
- Active adversarial domain adaptation. 2020 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 728–737, 2019.
- Deep semantic segmentation of natural and medical images: a review. Artificial Intelligence Review, 54:137 – 178, 2019. URL https://api.semanticscholar.org/CorpusID:204743865.
- Multinet: Real-time joint semantic reasoning for autonomous driving. 2018 IEEE Intelligent Vehicles Symposium (IV), pages 1013–1020, 2016. URL https://api.semanticscholar.org/CorpusID:5064446.
- Towards fewer annotations: Active learning via region impurity and prediction uncertainty for domain adaptive semantic segmentation. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8058–8068, 2021.
- Denseaspp for semantic segmentation in street scenes. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3684–3692, 2018. URL https://api.semanticscholar.org/CorpusID:52253102.