Language Guided Domain Generalized Medical Image Segmentation (2404.01272v2)
Abstract: Single source domain generalization (SDG) holds promise for more reliable and consistent image segmentation across real-world clinical settings particularly in the medical domain, where data privacy and acquisition cost constraints often limit the availability of diverse datasets. Depending solely on visual features hampers the model's capacity to adapt effectively to various domains, primarily because of the presence of spurious correlations and domain-specific characteristics embedded within the image features. Incorporating text features alongside visual features is a potential solution to enhance the model's understanding of the data, as it goes beyond pixel-level information to provide valuable context. Textual cues describing the anatomical structures, their appearances, and variations across various imaging modalities can guide the model in domain adaptation, ultimately contributing to more robust and consistent segmentation. In this paper, we propose an approach that explicitly leverages textual information by incorporating a contrastive learning mechanism guided by the text encoder features to learn a more robust feature representation. We assess the effectiveness of our text-guided contrastive feature alignment technique in various scenarios, including cross-modality, cross-sequence, and cross-site settings for different segmentation tasks. Our approach achieves favorable performance against existing methods in literature. Our code and model weights are available at https://github.com/ShahinaKK/LG_SDG.git.
- “Domain adaptation for medical image analysis: a survey,” IEEE Transactions on Biomedical Engineering, vol. 69, no. 3, pp. 1173–1185, 2021.
- “Unsupervised domain adaptation by backpropagation,” in International conference on machine learning. PMLR, 2015, pp. 1180–1189.
- “Domain generalization via invariant feature representation,” in International conference on machine learning. PMLR, 2013, pp. 10–18.
- “Causality-inspired single-source domain generalization for medical image segmentation,” IEEE Transactions on Medical Imaging, vol. 42, no. 4, pp. 1095–1106, 2022.
- “Rethinking data augmentation for single-source domain generalization in medical image segmentation,” in Proceedings of the AAAI Conference on Artificial Intelligence, 2023, vol. 37, pp. 2366–2374.
- “Generalizable cross-modality medical image segmentation via style augmentation and dual normalization,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 20856–20865.
- “Devil is in channels: Contrastive single domain generalization for medical image segmentation,” arXiv preprint arXiv:2306.05254, 2023.
- “Generalizing deep learning for medical image segmentation to unseen domains via deep stacked transformation,” IEEE transactions on medical imaging, vol. 39, no. 7, pp. 2531–2540, 2020.
- “Adversarial consistency for single domain generalization in medical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2022, pp. 671–681.
- “Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge,” in Proc. MICCAI Multi-Atlas Labeling Beyond Cranial Vault—Workshop Challenge, 2015, vol. 5, p. 12.
- “Chaos challenge-combined (ct-mr) healthy abdominal organ segmentation,” Medical Image Analysis, vol. 69, pp. 101950, 2021.
- “Cardiac segmentation on late gadolinium enhancement mri: a benchmark study from multi-sequence cardiac mr segmentation challenge,” Medical Image Analysis, vol. 81, pp. 102528, 2022.
- “Domain specific convolution and high frequency reconstruction based unsupervised domain adaptation for medical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2022.
- “Learning transferable visual models from natural language supervision,” in International conference on machine learning. PMLR, 2021, pp. 8748–8763.
- “V-net: Fully convolutional neural networks for volumetric medical image segmentation,” in 2016 fourth international conference on 3D vision (3DV). Ieee, 2016, pp. 565–571.