CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection (2403.19278v1)
Abstract: Domain adaptive object detection aims to adapt detection models to domains where annotated data is unavailable. Existing methods have been proposed to address the domain gap using the semi-supervised student-teacher framework. However, a fundamental issue arises from the class imbalance in the labelled training set, which can result in inaccurate pseudo-labels. The relationship between classes, especially where one class is a majority and the other minority, has a large impact on class bias. We propose Class-Aware Teacher (CAT) to address the class bias issue in the domain adaptation setting. In our work, we approximate the class relationships with our Inter-Class Relation module (ICRm) and exploit it to reduce the bias within the model. In this way, we are able to apply augmentations to highly related classes, both inter- and intra-domain, to boost the performance of minority classes while having minimal impact on majority classes. We further reduce the bias by implementing a class-relation weight to our classification loss. Experiments conducted on various datasets and ablation studies show that our method is able to address the class bias in the domain adaptation setting. On the Cityscapes to Foggy Cityscapes dataset, we attained a 52.5 mAP, a substantial improvement over the 51.2 mAP achieved by the state-of-the-art method.
- Contrastive mean teacher for domain adaptive object detectors. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23839--23848, 2023.
- Harmonizing transferability and discriminability for adapting object detectors. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- I3net: Implicit instance-invariant network for adapting one-stage object detectors. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12576--12585, 2021.
- Learning domain adaptive object detection with probabilistic teacher. In International Conference on Machine Learning, pages 3040--3055, 2022.
- Domain adaptive faster r-cnn for object detection in the wild. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3339--3348, 2018.
- Remix: rebalanced mixup. Computer Vision – ECCV 2020 Workshops, pages 95--110, 2020.
- The cityscapes dataset for semantic urban scene understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3213--3223, 2016.
- Unbiased mean teacher for cross-domain object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4089--4099, 2021.
- Harmonious teacher for cross-domain object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23829--23838, 2023.
- The pascal visual object classes (voc) challenge. International Journal of Computer Vision, pages 303--308, 2009.
- Balanced-mixup for highly imbalanced medical image classification. Medical Image Computing and Computer Assisted Intervention – MICCAI 2021, pages 323--333, 2021.
- Vision meets robotics: The kitti dataset. International Journal of Robotics Research, 2013.
- Ross Girshick. Fast r-cnn. In IEEE/CVF International Conference on Computer Vision, pages 1440--1448, 2015.
- Learning from imbalanced data. IEEE Transactions on knowledge and data engineering, 21(9):1263--1284, 2009.
- Deep residual learning for image recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 770--778, 2016.
- Cross domain object detection by target-perceived dual branch distillation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9560--9570, 2022.
- Unsupervised Domain Adaptation with Imbalanced Cross-Domain Data. In IEEE/CVF International Conference on Computer Vision, pages 4121--4129, 2015.
- Learning deep representation for imbalanced classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Aqt: Adversarial query transformers for domain adaptive object detection. In International Joint Conference on Artificial Intelligence (IJCAI), 2022.
- Cross-domain weakly-supervised object detection through progressive domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.
- Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation. In International Conference on Machine Learning, pages 4816--4827, 2020.
- Driving in the matrix: Can virtual worlds replace human-generated annotations for real world tasks? In International Conference on Robotics and Automation, pages 746--753. IEEE, 2017.
- Revisiting class imbalance for end-to-end semi-supervised object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 4570--4579, 2023.
- 2pcnet: Two-phase consistency training for day-to-night unsupervised domain adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11484--11493, 2023.
- Diversify and match: A domain adaptive representation learning paradigm for object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.
- Mila: Memory-based instance-level adaptation for cross-domain object detection. British Machine Vision Conference, (BMVC), 2023.
- Rethinking pseudo labels for semi-supervised object detection. In AAAI, 2021.
- Sigma: Semantic-complete graph matching for domain adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022a.
- Cross-domain adaptive teacher for object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7571--7580, 2022b.
- Unbiased teacher for semi-supervised object detection. In International Conference on Learning Representations, 2021.
- Imbalance Problems in Object Detection: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1--1, 2020.
- Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems, page 91–99, 2015.
- Strong-weak distribution alignment for adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.
- Semantic foggy scene understanding with synthetic data. International Journal of Computer Vision, 126(9):973--992, 2018.
- Very deep convolutional networks for large-scale image recognition. pages 1--14. Computational and Biological Learning Society, 2015.
- A Prototype-Oriented Framework for Unsupervised Domain Adaptation. In Advances in Neural Information Processing Systems, pages 17194--17208, 2021.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems, page 1195–1204, 2017.
- Fcos: Fully convolutional one-stage object detection. In IEEE/CVF International Conference on Computer Vision, 2019.
- Mega-cda: Memory guided attention for category-aware unsupervised domain adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4516--4526, 2021.
- Learning to model the tail. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017.
- Detectron2. https://github.com/facebookresearch/detectron2, 2019.
- Exploring categorical regularization for domain adaptive object detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11721--11730, 2020a.
- Cross-domain detection via graph-induced prototype alignment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020b.
- FDA: Fourier domain adaptation for semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4084--4094, 2020.
- Unsupervised domain adaptation for one-stage object detector using offsets to bounding box. In European Conference on Computer Vision, pages 691--708. Springer, 2022.
- Semi-supervised object detection with adaptive class-rebalancing self-training. AAAI, 36(3):3252--3261, 2022.
- mixup: Beyond empirical risk minimization. International Conference on Learning Representations, 2018.
- Task-specific inconsistency alignment for domain adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Masked retraining teacher-student framework for domain adaptive object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 19039--19049, 2023.
- Forkgan: Seeing into the rainy night. In European Conference on Computer Vision, 2020.
- Unpaired image-to-image translation using cycle-consistent adversarial networks. In IEEE/CVF International Conference on Computer Vision, pages 2242--2251, 2017.
- Deformable {detr}: Deformable transformers for end-to-end object detection. In International Conference on Learning Representations, 2021.