Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image Segmentation (2306.14293v1)
Abstract: Semi-supervised learning has demonstrated great potential in medical image segmentation by utilizing knowledge from unlabeled data. However, most existing approaches do not explicitly capture high-level semantic relations between distant regions, which limits their performance. In this paper, we focus on representation learning for semi-supervised learning, by developing a novel Multi-Scale Cross Supervised Contrastive Learning (MCSC) framework, to segment structures in medical images. We jointly train CNN and Transformer models, regularising their features to be semantically consistent across different scales. Our approach contrasts multi-scale features based on ground-truth and cross-predicted labels, in order to extract robust feature representations that reflect intra- and inter-slice relationships across the whole dataset. To tackle class imbalance, we take into account the prevalence of each class to guide contrastive learning and ensure that features adequately capture infrequent classes. Extensive experiments on two multi-structure medical segmentation datasets demonstrate the effectiveness of MCSC. It not only outperforms state-of-the-art semi-supervised methods by more than 3.0% in Dice, but also greatly reduces the performance gap with fully supervised methods.
- “Deep learning for cardiac image segmentation: a review” In Frontiers in Cardiovascular Medicine 7 Frontiers Media SA, 2020, pp. 25
- Dinggang Shen, Guorong Wu and Heung-Il Suk “Deep learning in medical image analysis” In Annual review of biomedical engineering 19 Annual Reviews, 2017, pp. 221–248
- “Embracing imperfect datasets: A review of deep learning solutions for medical image segmentation” In Medical Image Analysis 63 Elsevier, 2020, pp. 101693
- “Semi-supervised medical image segmentation via cross teaching between cnn and transformer” In International Conference on Medical Imaging with Deep Learning, 2022, pp. 820–833 PMLR
- “Pixel contrastive-consistent semi-supervised semantic segmentation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 7273–7282
- Yassine Ouali, Celine Hudelot and Myriam Tami “Semi-supervised semantic segmentation with cross-consistency training” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 12674–12684
- “Reference-Guided Pseudo-Label Generation for Medical Semantic Segmentation” In The Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022, pp. 2171–2179
- “Semi-supervised semantic segmentation with cross pseudo supervision” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 2613–2622
- “3D Semi-Supervised Learning with Uncertainty-Aware Multi-View Co-Training” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 3646–3655
- “Semi-supervised medical image segmentation via a tripled-uncertainty guided mean teacher model with contrastive learning” In Medical Image Analysis 79 Elsevier, 2022, pp. 102447
- “Self-supervised sequence recovery for semi-supervised retinal layer segmentation” In IEEE Journal of Biomedical and Health Informatics, 2022, pp. 3872–3883 IEEE
- “Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results” In Advances in neural information processing systems 30, 2017
- “Deep co-training for semi-supervised image recognition” In Proceedings of the european conference on computer vision (eccv), 2018, pp. 135–152
- “Uncertainty-aware self-ensembling model for semi-supervised 3D left atrium segmentation” In Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part II 22, 2019, pp. 605–613 Springer
- “Exploring simple siamese representation learning” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 15750–15758
- “Supervised Contrastive Learning” In Advances in Neural Information Processing Systems 33 Curran Associates, Inc., 2020, pp. 18661–18673 URL: https://proceedings.neurips.cc/paper_files/paper/2020/file/d89a66c7c80a29b1bdbab0f2a1a94af8-Paper.pdf
- “Cross-patch dense contrastive learning for semi-supervised segmentation of cellular nuclei in histopathologic images” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11666–11675
- “Contrastive learning of global and local features for medical image segmentation with limited annotations” In Advances in Neural Information Processing Systems 33, 2020, pp. 12546–12558
- “Exploring cross-image pixel contrast for semantic segmentation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 7303–7313
- Zeju Li, Konstantinos Kamnitsas and Ben Glocker “Analyzing overfitting under class imbalance in neural networks for image segmentation” In IEEE transactions on medical imaging 40.3 IEEE, 2020, pp. 1065–1077
- “Semi-supervised contrastive learning for label-efficient medical image segmentation” In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part II 24, 2021, pp. 481–490 Springer
- “Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation” In Medical Image Analysis 87 Elsevier, 2023, pp. 102792
- “RCPS: Rectified Contrastive Pseudo Supervision for Semi-Supervised Medical Image Segmentation” In arXiv preprint arXiv:2301.05500, 2023
- “Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved?” In IEEE transactions on medical imaging 37.11 ieee, 2018, pp. 2514–2525
- “Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge” In Proc. MICCAI Multi-Atlas Labeling Beyond Cranial Vault—Workshop Challenge 5, 2015, pp. 12
- “Deep co-training for semi-supervised image segmentation” In Pattern Recognition 107 Elsevier, 2020, pp. 107269
- “Semi-supervised medical image segmentation via learning consistency under transformations” In Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part VI 22, 2019, pp. 810–818 Springer
- “A simple framework for contrastive learning of visual representations” In International conference on machine learning, 2020, pp. 1597–1607 PMLR
- “Bootstrap your own latent-a new approach to self-supervised learning” In Advances in neural information processing systems 33, 2020, pp. 21271–21284
- Yuandong Tian, Xinlei Chen and Surya Ganguli “Understanding self-supervised learning dynamics without contrastive pairs” In International Conference on Machine Learning, 2021, pp. 10268–10278 PMLR
- “Momentum contrast for unsupervised visual representation learning” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 9729–9738
- “Propagate yourself: Exploring pixel-level consistency for unsupervised visual representation learning” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 16684–16693
- “Dense contrastive learning for self-supervised visual pre-training” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 3024–3033
- “Class-aware contrastive semi-supervised learning” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 14421–14430
- “Balanced contrastive learning for long-tailed visual recognition” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 6908–6917
- “Transunet: Transformers make strong encoders for medical image segmentation” In arXiv preprint arXiv:2102.04306, 2021
- “Decoupled weight decay regularization” In arXiv preprint arXiv:1711.05101, 2017
- “Interpolation consistency training for semi-supervised learning” In Neural Networks 145 Elsevier, 2022, pp. 90–106
- Yassine Ouali, Céline Hudelot and Myriam Tami “Semi-supervised semantic segmentation with cross-consistency training” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 12674–12684
- “BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation” In IEEE Journal of Biomedical and Health Informatics IEEE, 2023
- “nnformer: Interleaved transformer for volumetric segmentation” In arXiv preprint arXiv:2109.03201, 2021