Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations (2401.01035v1)
Abstract: Semantic segmentation models trained on annotated data fail to generalize well when the input data distribution changes over extended time period, leading to requiring re-training to maintain performance. Classic Unsupervised domain adaptation (UDA) attempts to address a similar problem when there is target domain with no annotated data points through transferring knowledge from a source domain with annotated data. We develop an online UDA algorithm for semantic segmentation of images that improves model generalization on unannotated domains in scenarios where source data access is restricted during adaptation. We perform model adaptation is by minimizing the distributional distance between the source latent features and the target features in a shared embedding space. Our solution promotes a shared domain-agnostic latent feature space between the two domains, which allows for classifier generalization on the target dataset. To alleviate the need of access to source samples during adaptation, we approximate the source latent feature distribution via an appropriate surrogate distribution, in this case a Gassian mixture model (GMM). We evaluate our approach on well established semantic segmentation datasets and demonstrate it compares favorably against state-of-the-art (SOTA) UDA semantic segmentation methods.
- Wasserstein GAN. arXiv preprint arXiv:1701.07875
- Unsupervised domain adaptation using generative adversarial networks for semantic segmentation of aerial images. Remote Sensing 11, 1369
- Deepjdot: Deep joint distribution optimal transport for unsupervised domain adaptation. In Proceedings of the European Conference on Computer Vision (ECCV). 447–463
- Optimization methods for large-scale machine learning. Siam Review 60, 223–311
- Unsupervised pixel-level domain adaptation with generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3722–3731
- Semantic-aware generative adversarial nets for unsupervised domain adaptation in chest x-ray segmentation. In International workshop on machine learning in medical imaging (Springer), 143–151
- Rethinking atrous convolution for semantic image segmentation. In arXiv:1706.05587 [cs.CV]
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In ECCV
- Learning semantic segmentation from synthetic data: A geometrically guided input-output adaptation approach. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1841–1850
- No more discrimination: Cross city adaptation of road scene segmenters. In arXiv:1704.08509 [cs.CV]
- Self-ensembling with gan-based data augmentation for domain adaptation in semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6830–6840
- The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3213–3223
- Online methods for multi-domain learning and adaptation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (Association for Computational Linguistics), 689–697
- Deep multi-modal object detection and semantic segmentation for autonomous driving: Datasets, methods, and challenges. IEEE Transactions on Intelligent Transportation Systems 22, 1341–1360
- Learning a domain-invariant embedding for unsupervised domain adaptation using class-conditioned distribution alignment. In 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton). 352–359
- Deep Learning (MIT Press). http://www.deeplearningbook.org
- Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672–2680
- Generative adversarial networks. Communications of the ACM 63, 139–144
- Domain adaptation for medical image analysis: a survey. IEEE Transactions on Biomedical Engineering 69, 1173–1185
- Deep residual learning for image recognition. CoRR abs/1512.03385
- Cycada: Cycle-consistent adversarial domain adaptation. In International conference on machine learning (Pmlr), 1989–1998
- CyCADA: Cycle-consistent adversarial domain adaptation. In International Conference on Machine Learning. 1989–1998
- Fcns in the wild: Pixel-level adversarial and constraint-based adaptation. arXiv preprint arXiv:1612.02649
- Conditional generative adversarial network for structured domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1335–1344
- Adversarial learning for semi-supervised semantic segmentation. arXiv preprint arXiv:1802.07934
- Online domain adaptation of a pre-trained cascade of classifiers. In Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. 577–584
- Unsupervised domain adaptation for training event-based networks using contrastive learning and uncorrelated conditioning. arXiv preprint arXiv:2303.12424
- Analysis based on recent deep learning approaches applied in real-time multi-object tracking: a review. IEEE Access 9, 32650–32671
- Domain adaptation without source data. IEEE Transactions on Artificial Intelligence 2, 508–518
- Generalized sliced wasserstein distances. Advances in neural information processing systems 32
- Sliced wasserstein kernels for probability distributions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5258–5267
- Generalize then adapt: Source-free domain adaptive semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 7046–7056
- Universal source-free domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4544–4553
- Universal source-free domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks 3361, 1995
- Sliced wasserstein discrepancy for unsupervised domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10285–10295
- Enhanced transport distance for unsupervised domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13936–13944
- Multi-site fmri analysis using privacy-preserving federated learning and domain adaptation: Abide results. Medical Image Analysis 65, 101765
- Feature pyramid networks for object detection
- Source-free domain adaptation for semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1215–1224
- Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3431–3440
- Semantic segmentation using adversarial networks. In NIPS Workshop on Adversarial Training
- UMAP: Uniform manifold approximation and projection. Journal of Open Source Software 3, 861
- Image to image translation for domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
- Theoretical analysis of domain adaptation with optimal transport. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (Springer), 737–753
- Playing for data: Ground truth from computer games. In European conference on computer vision (Springer), 102–118
- Bridging the day and night domain gap for semantic segmentation. In 2019 IEEE Intelligent Vehicles Symposium (IV) (IEEE), 1312–1318
- U-net: Convolutional networks for biomedical image segmentation
- The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3234–3243
- Rostami, M. (2019). Learning Transferable Knowledge Through Embedding Spaces. Ph.D. thesis, University of Pennsylvania
- Rostami, M. (2021). Lifelong domain adaptation via consolidated internal distribution. Advances in neural information processing systems 34, 11172–11183
- Rostami, M. (2022). Increasing model generalizability for unsupervised visual domain adaptation. In Conference on Lifelong Learning Agents (PMLR), 281–293
- Domain adaptation for sentiment analysis using robust internal representations. In Findings of the Association for Computational Linguistics: EMNLP 2023. 11484–11498
- Overcoming concept shift in domain-aware settings through consolidated internal distributions. In Proceedings of the AAAI conference on artificial intelligence. vol. 37, 9623–9631
- Thirty-second international joint conference on artificial intelligence. In arXiv preprint arXiv:2110.04662
- A crowdsourcing triage algorithm for geopolitical event forecasting. In Proceedings of the 12th ACM Conference on Recommender Systems. 377–381
- Deep transfer learning for few-shot sar image classification. Remote Sensing 11, 1374
- Sar image classification using few-shot cross-domain transfer learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 0–0
- Generative continual concept learning. In Proceedings of the AAAI conference on artificial intelligence. vol. 34, 5545–5552
- Complementary learning for overcoming catastrophic forgetting using experience replay. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (AAAI Press), 3339–3345
- Maximum classifier discrepancy for unsupervised domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3723–3732
- Model adaptation with synthetic and real data for semantic dense foggy scene understanding. In Proceedings of the European Conference on Computer Vision (ECCV). 687–704
- Sf-uda 3d: Source-free unsupervised domain adaptation for lidar-based 3d object detection. In 2020 International Conference on 3D Vision (3DV) (IEEE), 771–780
- Learning from synthetic data: Addressing domain shift for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3752–3761
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
- Convolutional wasserstein distances: Efficient optimal transportation on geometric domains. ACM Transactions on Graphics (TOG) 34, 66
- Privacy preserving domain adaptation for semantic segmentation of medical images. arXiv preprint arXiv:2101.00522 1
- Unsupervised model adaptation for continual semantic segmentation. In Proceedings of the AAAI conference on artificial intelligence. vol. 35, 2593–2601
- Secure domain adaptation with multiple sources. Transactions on Machine Learning Research
- Segmenter: Transformer for semantic segmentation
- Aerial-pass: Panoramic annular scene segmentation in drone videos. In 2021 European Conference on Mobile Robots (ECMR). 1–6. 10.1109/ECMR50962.2021.9568802
- Hierarchical multi-scale attention for semantic segmentation
- Wasserstein auto-encoders. arXiv preprint arXiv:1711.01558
- Learning to adapt structured output space for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7472–7481
- Advent: Adversarial entropy minimization for domain adaptation in semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2517–2526
- Axial-deeplab: Stand-alone axial-attention for panoptic segmentation. In European Conference on Computer Vision (ECCV)
- Deep visual domain adaptation: A survey. Neurocomputing 312, 135–153
- Weakly supervised adversarial domain adaptation for semantic segmentation in urban scenes. IEEE Transactions on Image Processing 28, 4376–4386. 10.1109/TIP.2019.2910667
- Revisiting dilated convolution: A simple approach for weakly- and semi-supervised semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- A survey of unsupervised deep domain adaptation. ACM Transactions on Intelligent Systems and Technology (TIST) 11, 1–46
- Wu, D. (2016). Online and offline domain adaptation for reducing bci calibration effort. IEEE Transactions on Human-Machine Systems 47, 550–563
- Unsupervised domain adaptation for graph-structured data using class-conditional distribution alignment. arXiv preprint arXiv:2301.12361
- Dcan: Dual channel-wise alignment networks for unsupervised scene adaptation. In Proceedings of the European Conference on Computer Vision (ECCV). 518–534
- Federated-learning-based client scheduling for low-latency wireless communications. IEEE Wireless Communications 28, 32–38. 10.1109/MWC.001.2000252
- Reliable weighted optimal transport for unsupervised domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4394–4403
- Generalized source-free domain adaptation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 8978–8987
- Phase consistent ecological domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9011–9020
- Fda: Fourier domain adaptation for semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4085–4095
- Domain adaptive semantic segmentation without source data. In Proceedings of the 29th ACM International Conference on Multimedia (New York, NY, USA: Association for Computing Machinery), MM ’21, 3293–3302. 10.1145/3474085.3475482
- Divergence optimization for noisy universal domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2515–2524
- Category anchor-guided unsupervised domain adaptation for semantic segmentation. In Advances in Neural Information Processing Systems. 435–445
- Curriculum domain adaptation for semantic segmentation of urban scenes. In Proceedings of the IEEE International Conference on Computer Vision. 2020–2030
- Instance-level segmentation for autonomous driving with deep densely connected mrfs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 669–677
- Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV. 2223–2232
- Serban Stan (6 papers)
- Mohammad Rostami (64 papers)