SupReMix: Supervised Contrastive Learning for Medical Imaging Regression with Mixup (2309.16633v4)
Abstract: In medical image analysis, regression plays a critical role in computer-aided diagnosis. It enables quantitative measurements such as age prediction from structural imaging, cardiac function quantification, and molecular measurement from PET scans. While deep learning has shown promise for these tasks, most approaches focus solely on optimizing regression loss or model architecture, neglecting the quality of learned feature representations which are crucial for robust clinical predictions. Directly applying representation learning techniques designed for classification to regression often results in fragmented representations in the latent space, yielding sub-optimal performance. In this paper, we argue that the potential of contrastive learning for medical image regression has been overshadowed due to the neglect of two crucial aspects: ordinality-awareness and hardness. To address these challenges, we propose Supervised Contrastive Learning for Medical Imaging Regression with Mixup (SupReMix). It takes anchor-inclusive mixtures (mixup of the anchor and a distinct negative sample) as hard negative pairs and anchor-exclusive mixtures (mixup of two distinct negative samples) as hard positive pairs at the embedding level. This strategy formulates harder contrastive pairs by integrating richer ordinal information. Through theoretical analysis and extensive experiments on six datasets spanning MRI, X-ray, ultrasound, and PET modalities, we demonstrate that SupReMix fosters continuous ordered representations, significantly improving regression performance.
- Image processing and quality control for the first 10,000 brain imaging datasets from uk biobank. NeuroImage, 166:400–424, 2018. ISSN 1053-8119. doi: https://doi.org/10.1016/j.neuroimage.2017.10.034.
- Uk biobank: Current status and what it means for epidemiology. Health Policy and Technology, 1(3):123–126, 2012. ISSN 2211-8837. doi: https://doi.org/10.1016/j.hlpt.2012.07.003. URL https://www.sciencedirect.com/science/article/pii/S2211883712000597.
- The lifespan human connectome project in aging: an overview. Neuroimage, 185:335–348, 2019.
- Semeval-2017 task 1: Semantic textual similarity-multilingual and cross-lingual focused evaluation. arXiv preprint arXiv:1708.00055, 2017.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pp. 1597–1607. PMLR, 2020a.
- Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297, 2020b.
- Weakly supervised regression with interval targets. arXiv preprint arXiv:2306.10458, 2023.
- Uci machine learning repository, 2019. URL http://archive.ics.uci.edu/ml. University of California, Irvine, School of Information and Computer Sciences.
- Conditional alignment and uniformity for contrastive learning with continuous proxy labels. In Med-NeurIPS-Workshop NeurIPS, 2021.
- SimCSE: Simple contrastive learning of sentence embeddings. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 6894–6910, Online and Punta Cana, Dominican Republic, November 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.emnlp-main.552. URL https://aclanthology.org/2021.emnlp-main.552.
- Can spatiotemporal 3d cnns retrace the history of 2d cnns and imagenet? In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 6546–6555, 2018.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, 2016. doi: 10.1109/CVPR.2016.90.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9729–9738, 2020.
- Contrastive learning with adversarial examples. Advances in Neural Information Processing Systems, 33:17081–17093, 2020.
- Holmes: Health online model ensemble serving for deep learning models in intensive care units. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1614–1624, 2020.
- Hard negative mixing for contrastive learning. Advances in Neural Information Processing Systems, 33:21798–21809, 2020.
- Supervised contrastive learning. Advances in neural information processing systems, 33:18661–18673, 2020.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Spatial topography of individual-specific cortical networks predicts human cognition, personality, and emotion. Cerebral cortex, 29(6):2533–2551, 2019.
- A comprehensive analysis of deep regression. IEEE transactions on pattern analysis and machine intelligence, 42(9):2065–2081, 2019.
- Nltk: The natural language toolkit. arXiv preprint cs/0205028, 2002.
- Multimodal population brain imaging in the uk biobank prospective epidemiological study. Nature neuroscience, 19(11):1523–1536, 2016.
- Agedb: the first manually collected, in-the-wild age database. In proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 51–59, 2017.
- Ordinal regression with multiple output cnn for age estimation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4920–4928, 2016.
- Accurate brain age prediction with lightweight deep neural networks. Medical image analysis, 68:101871, 2021.
- Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543, 2014.
- Contrastive learning with hard negative samples. arXiv preprint arXiv:2010.04592, 2020.
- Deep expectation of real and apparent age from a single image without facial landmarks. International Journal of Computer Vision, 126(2-4):144–157, 2018.
- Local-global parcellation of the human cerebral cortex from intrinsic functional connectivity mri. Cerebral cortex, 28(9):3095–3114, 2018.
- Learnable latent embeddings for joint behavioural and neural analysis. Nature, pp. 1–9, 2023.
- Kihyuk Sohn. Improved deep metric learning with multi-class n-pair loss objective. Advances in neural information processing systems, 29, 2016.
- 3d self-supervised methods for medical imaging. Advances in neural information processing systems, 33:18158–18172, 2020.
- Laurens JP van der Maaten and Geoffrey E Hinton. Visualizing high-dimensional data using t-sne. Journal of Machine Learning Research, 9(Nov):2579–2605, 2008.
- GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pp. 353–355, Brussels, Belgium, November 2018. Association for Computational Linguistics. doi: 10.18653/v1/W18-5446. URL https://aclanthology.org/W18-5446.
- Contrastive regression for domain adaptation on gaze estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 19376–19385, 2022.
- Distance metric learning for large margin nearest neighbor classification. Journal of machine learning research, 10(2), 2009.
- Synthetic data can also teach: Synthesizing effective data for unsupervised visual representation learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pp. 2866–2874, 2023.
- Delving into deep imbalanced regression. In Marina Meila and Tong Zhang (eds.), Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pp. 11842–11851. PMLR, 18–24 Jul 2021. URL https://proceedings.mlr.press/v139/yang21m.html.
- Group-aware contrastive regression for action quality assessment. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 7919–7928, 2021.
- Supervised contrastive regression. arXiv preprint arXiv:2210.01189, 2022.
- mixup: Beyond empirical risk minimization. In International Conference on Learning Representations, 2018.
- Improving deep regression with ordinal entropy. In The Eleventh International Conference on Learning Representations, 2022.