Open Compound Domain Adaptation with Object Style Compensation for Semantic Segmentation (2309.16127v1)
Abstract: Many methods of semantic image segmentation have borrowed the success of open compound domain adaptation. They minimize the style gap between the images of source and target domains, more easily predicting the accurate pseudo annotations for target domain's images that train segmentation network. The existing methods globally adapt the scene style of the images, whereas the object styles of different categories or instances are adapted improperly. This paper proposes the Object Style Compensation, where we construct the Object-Level Discrepancy Memory with multiple sets of discrepancy features. The discrepancy features in a set capture the style changes of the same category's object instances adapted from target to source domains. We learn the discrepancy features from the images of source and target domains, storing the discrepancy features in memory. With this memory, we select appropriate discrepancy features for compensating the style information of the object instances of various categories, adapting the object styles to a unified style of source domain. Our method enables a more accurate computation of the pseudo annotations for target domain's images, thus yielding state-of-the-art results on different datasets.
- Open compound domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12406–12415, 2020.
- Discover, hallucinate, and adapt: Open compound domain adaptation for semantic segmentation. Advances in Neural Information Processing Systems, 33:10869–10880, 2020.
- Cluster, split, fuse, and update: Meta-learning for open compound domain adaptive semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8344–8354, 2021.
- Amplitude spectrum transformation for open compound domain adaptive semantic segmentation. In AAAI Conference on Artificial Intelligence, volume 36, pages 1220–1227, 2022.
- Ml-bpm: Multi-teacher learning with bidirectional photometric mixing for open compound domain adaptation in semantic segmentation. In European Conference on Computer Vision, pages 236–251, 2022.
- Acdc: The adverse conditions dataset with correspondences for semantic driving scene understanding. In IEEE/CVF International Conference on Computer Vision, pages 10765–10775, 2021.
- The cityscapes dataset for semantic urban scene understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3213–3223, 2016.
- Augmented reality meets computer vision: Efficient data generation for urban driving scenes. International Journal of Computer Vision, 126:961–972, 2018.
- Wilddash-creating hazard-aware benchmarks. In European Conference on Computer Vision, pages 402–416, 2018.
- Model adaptation: Unsupervised domain adaptation without source data. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9641–9650, 2020.
- Fixbi: Bridging domain spaces for unsupervised domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1094–1103, 2021.
- Ni Xiao and Lei Zhang. Dynamic weighted learning for unsupervised domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15242–15251, 2021.
- Universal domain adaptation through self supervision. Advances in Neural Information Processing Systems, 33:16282–16292, 2020.
- Enhanced transport distance for unsupervised domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13936–13944, 2020.
- Gradually vanishing bridge for adversarial domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12455–12464, 2020.
- Unsupervised intra-domain adaptation for semantic segmentation through self-supervision. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3764–3773, 2020.
- Adversarial domain adaptation with domain mixup. In AAAI Conference on Artificial Intelligence, volume 34, pages 6502–6509, 2020.
- Universal source-free domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4544–4553, 2020.
- Progressive domain adaptation for object detection. In IEEE/CVF Winter Conference on Applications of Computer Vision, pages 749–757, 2020.
- Fda: Fourier domain adaptation for semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4085–4095, 2020.
- Bidirectional learning for domain adaptation of semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6936–6945, 2019.
- Unsupervised multi-target domain adaptation: An information theoretic approach. IEEE Transactions on Image Processing, 29:3993–4002, 2020.
- Multi-target adversarial frameworks for domain adaptation in semantic segmentation. In IEEE/CVF International Conference on Computer Vision, pages 9072–9081, 2021.
- Curriculum graph co-teaching for multi-target domain adaptation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5351–5360, 2021.
- Multi-target domain adaptation with collaborative consistency learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8187–8196, 2021.
- Learning to diversify for single domain generalization. In IEEE/CVF International Conference on Computer Vision, pages 834–843, 2021.
- Learning to generate novel domains for domain generalization. In European Conference on Computer Vision, pages 561–578, 2020.
- Model-based domain generalization. Advances in Neural Information Processing Systems, 34:20210–20229, 2021.
- Domain generalization via entropy regularization. Advances in Neural Information Processing Systems, 33:16096–16107, 2020.
- Domain generalization using causal matching. In International Conference on Machine Learning, pages 7313–7324. PMLR, 2021.
- Domain generalization using a mixture of multiple latent domains. In AAAI Conference on Artificial Intelligence, volume 34, pages 11749–11756, 2020.
- Swad: Domain generalization by seeking flat minima. Advances in Neural Information Processing Systems, 34:22405–22418, 2021.
- Test-time classifier adjustment module for model-agnostic domain generalization. Advances in Neural Information Processing Systems, 34:2427–2440, 2021.
- Fsdr: Frequency space domain randomization for domain generalization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6891–6902, 2021.
- Selfreg: Self-supervised contrastive regularization for domain generalization. In IEEE/CVF International Conference on Computer Vision, pages 9619–9628, 2021.
- A fourier-based framework for domain generalization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14383–14392, 2021.
- Compound domain generalization via meta-knowledge encoding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7119–7129, 2022.
- Acmm: Aligned cross-modal memory for few-shot image and sentence matching. In IEEE/CVF International Conference on Computer Vision, pages 5774–5783, 2019.
- Meshed-memory transformer for image captioning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10578–10587, 2020.
- Dm-gan: Dynamic memory generative adversarial networks for text-to-image synthesis. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5802–5810, 2019.
- Video object segmentation using space-time memory networks. In IEEE/CVF International Conference on Computer Vision, pages 9226–9235, 2019.
- Memory oriented transfer learning for semi-supervised image deraining. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7732–7741, 2021.
- A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In IEEE/CVF International Conference on Computer Vision, pages 13588–13597, 2021.
- Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12655–12663, 2020.
- Memory-guided unsupervised image-to-image translation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6558–6567, 2021.
- Generative memory-guided semantic reasoning model for image inpainting. IEEE Transactions on Circuits and Systems for Video Technology, 32(11):7432–7447, 2022.
- Momentum contrast for unsupervised visual representation learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9729–9738, 2020.
- Texture memory-augmented deep patch-based image inpainting. IEEE Transactions on Image Processing, 30:9112–9124, 2021.
- Cross-image context for single image inpainting. Advances in Neural Information Processing Systems, 35:1474–1487, 2022.
- Exploring cross-image pixel contrast for semantic segmentation. In IEEE/CVF International Conference on Computer Vision, pages 7303–7313, 2021.
- Domain adaptation with auxiliary target domain-oriented classifier. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16632–16642, 2021.
- Mega-cda: Memory guided attention for category-aware unsupervised domain adaptive object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4516–4526, 2021.
- St3d++: denoised self-training for unsupervised domain adaptation on 3d object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
- Pin the memory: Learning to generalize semantic segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4350–4360, 2022.
- Playing for data: Ground truth from computer games. In European Conference on Computer Vision, pages 102–118, 2016.
- The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In IEEE Conference on Computer Vision and Pattern Recognition, pages 3234–3243, 2016.
- Dacs: Domain adaptation via cross-domain mixed sampling. In IEEE/CVF Winter Conference on Applications of Computer Vision, pages 1379–1389, 2021.
- Robustnet: Improving domain generalization in urban-scene segmentation via instance selective whitening. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11580–11590, 2021.