Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss (1804.10916v2)

Published 29 Apr 2018 in cs.CV

Abstract: Convolutional networks (ConvNets) have achieved great successes in various challenging vision tasks. However, the performance of ConvNets would degrade when encountering the domain shift. The domain adaptation is more significant while challenging in the field of biomedical image analysis, where cross-modality data have largely different distributions. Given that annotating the medical data is especially expensive, the supervised transfer learning approaches are not quite optimal. In this paper, we propose an unsupervised domain adaptation framework with adversarial learning for cross-modality biomedical image segmentations. Specifically, our model is based on a dilated fully convolutional network for pixel-wise prediction. Moreover, we build a plug-and-play domain adaptation module (DAM) to map the target input to features which are aligned with source domain feature space. A domain critic module (DCM) is set up for discriminating the feature space of both domains. We optimize the DAM and DCM via an adversarial loss without using any target domain label. Our proposed method is validated by adapting a ConvNet trained with MRI images to unpaired CT data for cardiac structures segmentations, and achieved very promising results.

PDF Abstract

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss

The paper under discussion presents an innovative approach to addressing the domain adaptation challenge inherent in the cross-modality application of convolutional networks (ConvNets) within the domain of biomedical image segmentation. The authors propose an unsupervised domain adaptation framework leveraging adversarial learning, effectively bridging the distributional gap between different imaging modalities such as MRI and CT.

In biomedical image analysis, the issue of domain shift between modalities like MRI and CT is pronounced due to fundamentally different imaging principles. Traditional methods relying on labeled datasets or supervised transfer learning are often impractical due to the significant cost and time associated with data annotation in medical contexts. This paper addresses these challenges by proposing a solution that minimizes reliance on labeled target domain data.

Framework and Methodology

The proposed framework is anchored in the use of a dilated fully convolutional network for pixel-wise prediction. Central to their approach is the development and integration of a plug-and-play domain adaptation module (DAM) and a domain critic module (DCM). The DAM is engineered to map input from the target domain to the feature space of the source domain. In contrast, the DCM serves to discriminate between feature spaces from both domains, effectively functioning as a discriminator in the style of a GAN framework.

Using an adversarial training schema, the network optimizes the DAM and DCM without the need for labeled data from the target domain, thus adopting an unsupervised approach. Specifically, the amalgamation of these modules in the learning process allows for the minimization of feature space distribution discrepancies between source and target domains, as measured by the Wasserstein distance.

Results

The authors validate their framework by adapting a ConvNet trained on MRI images to unpaired CT data for cardiac structure segmentation. Their results indicate promising performance improvements relative to direct cross-modality applications without domain adaptation. Quantitative metrics such as Dice coefficients and average surface distance (ASD) demonstrate the superiority of their approach compared to non-adaptive baselines and illustrate the potential for accurate segmentations without additional labeling costs.

Implications

This work offers significant implications for the utilization of deep learning models in medical image analysis, suggesting a pathway for expanding the efficacy of pretrained models across different imaging techniques without the burden of new, modality-specific annotations. The proposed methodology not only has practical implications for clinical applications, reducing the need for extensive dataset annotation but also poses theoretical advancements in adversarial learning techniques applied to unsupervised domain adaptation.

Future Directions

The framework presented sets a foundation for future exploration in unsupervised domain adaptation and cross-modality applications in other domains beyond biomedical imagery. Expanding the methodology to other complex domain shifts in medical image modalities and further optimization of adversarial learning strategies within deep ConvNet architectures are natural extensions. Additionally, evaluating the integration of this approach in clinical practice, assessing real-world implications, and model generalizability across diverse datasets remain crucial steps forward.

The paper, therefore, contributes significantly to the discourse on domain adaptation in computational medicine, providing a robust unsupervised learning framework capable of overcoming the inherent challenges of cross-modality ConvNet applications in medical imaging.

PDF Markdown Bookmark Chat (Pro)

Authors (5)

Qi Dou (163 papers)
Cheng Ouyang (60 papers)
Cheng Chen (262 papers)
Hao Chen (1006 papers)
Pheng-Ann Heng (196 papers)

Citations (287)

View on Semantic Scholar

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss (1804.10916v2)