Source-Free Domain Adaptation for Semantic Segmentation (2103.16372v1)

Published 30 Mar 2021 in cs.CV

Abstract: Unsupervised Domain Adaptation (UDA) can tackle the challenge that convolutional neural network(CNN)-based approaches for semantic segmentation heavily rely on the pixel-level annotated data, which is labor-intensive. However, existing UDA approaches in this regard inevitably require the full access to source datasets to reduce the gap between the source and target domains during model adaptation, which are impractical in the real scenarios where the source datasets are private, and thus cannot be released along with the well-trained source models. To cope with this issue, we propose a source-free domain adaptation framework for semantic segmentation, namely SFDA, in which only a well-trained source model and an unlabeled target domain dataset are available for adaptation. SFDA not only enables to recover and preserve the source domain knowledge from the source model via knowledge transfer during model adaptation, but also distills valuable information from the target domain for self-supervised learning. The pixel- and patch-level optimization objectives tailored for semantic segmentation are seamlessly integrated in the framework. The extensive experimental results on numerous benchmark datasets highlight the effectiveness of our framework against the existing UDA approaches relying on source data.

View on arXiv

Authors (3)

Yuang Liu (15 papers)
Wei Zhang (1489 papers)
Jun Wang (991 papers)

Citations (232)

View on Semantic Scholar

Summary

Source-Free Domain Adaptation for Semantic Segmentation

The paper "Source-Free Domain Adaptation for Semantic Segmentation," addresses a significant limitation in traditional unsupervised domain adaptation (UDA) frameworks, which typically require access to source datasets during the adaptation process. This requirement is often impractical due to privacy concerns or proprietary constraints. The authors propose a novel framework, SFDA, which tackles this issue by enabling domain adaptation without direct access to source data.

Problem Definition

Semantic segmentation is a critical task in computer vision, involving the partitioning of images into regions that correspond to distinct semantic categories. Current models heavily rely on large, annotated datasets, which are costly to produce. UDA techniques attempt to transfer knowledge from a labeled source domain to an unlabeled target domain, mitigating the need for extensive target annotations. However, existing solutions assume availability of the source data which is unrealistic in many real-world applications. The SFDA framework is proposed to overcome this challenge by utilizing only a trained source model and an unlabeled target dataset.

Methodology

The SFDA framework operates in two main stages, knowledge transfer and model adaptation, without requiring source data:

Knowledge Transfer: The SFDA introduces a source-free knowledge distillation process. A generator synthesizes pseudo-source samples that adhere to the feature space statistics of the source model, using batch normalization statistics (BNS) as a guide. The dual attention distillation (DAD) mechanism is employed to capture spatial and contextual features crucial for segmentation. Through the generated pseudo samples, the source knowledge is transferred to an adaptable target model.
Model Adaptation: The second stage leverages an entropy-based intra-domain patch-level self-supervision module (IPSM). This module differentiates between easy and hard-to-adapt patches in the target domain using entropy measures and employs adversarial learning to align these discrepancies, thereby improving the model's generalization to target-specific contexts.

Experimental Results

The SFDA framework has been evaluated on standard synthetic-to-real and cross-city adaptation tasks, predominantly using the DeepLabV3 and SegNet architectures:

Synthetic-to-Real Adaptation: In experiments involving adaptation from GTA5 and SYNTHIA datasets to the Cityscapes dataset, SFDA achieves competitive results with leading source-driven methods, despite the absence of source data. The framework's ability to distill robust domain knowledge from the source model significantly enhances segmentation performance on the target datasets.
Cross-City Adaptation: Trials using Cityscapes as a source domain and various cities in the NTHU dataset as targets demonstrate the framework's efficiency in handling smaller domain gaps, again achieving results on par or superior to methods requiring source data access.

Implications and Future Work

The SFDA approach expands the potential applications of UDA by eliminating the dependency on source datasets, enhancing privacy and adaptability, especially in sensitive fields such as autonomous driving and medical imaging. By incorporating knowledge distillation and adversarial learning, the proposed method effectively bridges domain gaps without access to source data.

Looking forward, improving the resolution capabilities of the generative module could extend the applicability to tasks requiring high-resolution processing, enhancing the precision of segmentations in more fine-grained applications. Additionally, integrating this approach with other AI systems can foster further advancements in domain adaptation scenarios.

In summary, the SFDA framework represents a practical advancement in the implementation of UDA for semantic segmentation, particularly in scenarios where source data availability is restricted due to privacy concerns or proprietary reasons.

PDF Markdown

Related Papers

Find Related Papers