Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Fairness of Automated Chest X-ray Diagnosis by Contrastive Learning (2401.15111v1)

Published 25 Jan 2024 in eess.IV, cs.CV, and cs.LG

Abstract: Purpose: Limited studies exploring concrete methods or approaches to tackle and enhance model fairness in the radiology domain. Our proposed AI model utilizes supervised contrastive learning to minimize bias in CXR diagnosis. Materials and Methods: In this retrospective study, we evaluated our proposed method on two datasets: the Medical Imaging and Data Resource Center (MIDRC) dataset with 77,887 CXR images from 27,796 patients collected as of April 20, 2023 for COVID-19 diagnosis, and the NIH Chest X-ray (NIH-CXR) dataset with 112,120 CXR images from 30,805 patients collected between 1992 and 2015. In the NIH-CXR dataset, thoracic abnormalities include atelectasis, cardiomegaly, effusion, infiltration, mass, nodule, pneumonia, pneumothorax, consolidation, edema, emphysema, fibrosis, pleural thickening, or hernia. Our proposed method utilizes supervised contrastive learning with carefully selected positive and negative samples to generate fair image embeddings, which are fine-tuned for subsequent tasks to reduce bias in chest X-ray (CXR) diagnosis. We evaluated the methods using the marginal AUC difference ($\delta$ mAUC). Results: The proposed model showed a significant decrease in bias across all subgroups when compared to the baseline models, as evidenced by a paired T-test (p<0.0001). The $\delta$ mAUC obtained by our method were 0.0116 (95\% CI, 0.0110-0.0123), 0.2102 (95% CI, 0.2087-0.2118), and 0.1000 (95\% CI, 0.0988-0.1011) for sex, race, and age on MIDRC, and 0.0090 (95\% CI, 0.0082-0.0097) for sex and 0.0512 (95% CI, 0.0512-0.0532) for age on NIH-CXR, respectively. Conclusion: Employing supervised contrastive learning can mitigate bias in CXR diagnosis, addressing concerns of fairness and reliability in deep learning-based diagnostic methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Artificial intelligence in healthcare. Nat Biomed Eng, 2(10):719–731, October 2018. ISSN 2157-846X. 10.1038/s41551-018-0305-z.
  2. Artificial intelligence in tumor subregion analysis based on medical imaging: A review. J. Appl. Clin. Med. Phys., 22(7):10–26, July 2021. ISSN 1526-9914. 10.1002/acm2.13321.
  3. ChestX-Ray8: Hospital-Scale chest X-Ray database and benchmarks on Weakly-Supervised classification and localization of common thorax diseases. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3462–3471, July 2017. 10.1109/CVPR.2017.369.
  4. CheXNet: radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv:1711.05225 [cs, stat], December 2017.
  5. Primary Open-Angle glaucoma diagnosis from optic disc photographs using a siamese network. Ophthalmol Sci, 2(4):100209, December 2022a. ISSN 2666-9145. 10.1016/j.xops.2022.100209.
  6. Automated diagnosing primary open-angle glaucoma from fundus image by simulating human’s grading with deep learning. Sci. Rep., 12(1):14080, August 2022b. ISSN 2045-2322. 10.1038/s41598-022-17753-4.
  7. Implementing machine learning in health care — addressing ethical challenges. N. Engl. J. Med., 378(11):981–983, 2018. ISSN 0028-4793. 10.1056/nejmp1714229.
  8. A survey on bias and fairness in machine learning. ACM Comput. Surv., 54(6):1–35, July 2021. ISSN 0360-0300. 10.1145/3457607.
  9. Improving model fairness in image-based computer-aided diagnosis. Nat. Commun., 14(1):6261, October 2023a. ISSN 2041-1723. 10.1038/s41467-023-41974-4.
  10. Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nat. Med., 27(12):2176–2182, December 2021. ISSN 1078-8956, 1546-170X. 10.1038/s41591-021-01595-0.
  11. Evaluate underdiagnosis and overdiagnosis bias of deep learning model on primary open-angle glaucoma diagnosis in under-served patient populations. In AMIA Informatics Summit, pages 1–9, January 2023b.
  12. Classification of COVID-19 in chest x-ray images using DeTraC deep convolutional neural network. Appl Intell (Dordr), 51(2):854–864, 2021. ISSN 1573-7497, 0924-669X. 10.1007/s10489-020-01829-7.
  13. Deep-COVID: Predicting COVID-19 from chest x-ray images using deep transfer learning. Med. Image Anal., 65:101794, October 2020. ISSN 1361-8415, 1361-8423. 10.1016/j.media.2020.101794.
  14. An end-to-end framework for diagnosing COVID-19 pneumonia via parallel recursive MLP module and Bi-LTSM correlation. April 2023a.
  15. A new classification method for diagnosing covid-19 pneumonia based on joint cnn features of chest x-ray images and parallel pyramid mlp-mixer module. Neural Computing and Applications, pages 1–13, 2023b.
  16. AI fairness via domain adaptation. March 2021.
  17. Training confounder-free deep learning models for medical applications. Nat. Commun., 11(1):6010, November 2020. ISSN 2041-1723. 10.1038/s41467-020-19784-9.
  18. FairDisCo: Fairer AI in dermatology via disentanglement contrastive learning. In Computer Vision – ECCV 2022 Workshops, pages 185–202. Springer Nature Switzerland, 2023. 10.1007/978-3-031-25069-9_13.
  19. FairPrune: Achieving fairness through pruning for dermatological disease diagnosis. In Medical Image Computing and Computer Assisted Intervention – MICCAI 2022, pages 743–753. Springer Nature Switzerland, 2022. 10.1007/978-3-031-16431-6_70.
  20. Improving the fairness of chest x-ray classifiers. In Proceedings of the Conference on Health, Inference, and Learning, pages 204–233. PMLR, April 2022.
  21. An empirical characterization of fair machine learning for clinical risk prediction. J. Biomed. Inform., 113:103621, January 2021. ISSN 1532-0464, 1532-0480. 10.1016/j.jbi.2020.103621.
  22. Supervised contrastive learning. Adv. Neural Inf. Process. Syst., 33:18661–18673, 2020. ISSN 1049-5258.
  23. The 2021 SIIM-FISABIO-RSNA machine learning COVID-19 challenge: Annotation and standard exam classification of COVID-19 chest radiographs. J. Digit. Imaging, 36(1):365–372, February 2023. ISSN 0897-1889, 1618-727X. 10.1007/s10278-022-00706-8.
  24. Pairwise fairness for ranking and regression. In AAAI, volume 34, pages 5248–5255, April 2020. 10.1609/aaai.v34i04.5970.
  25. Densely connected convolutional networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4700–4708. IEEE, July 2017. ISBN 9781538604571. 10.1109/cvpr.2017.243.
  26. MIMIC-CXR: A large publicly available database of labeled chest radiographs. arXiv preprint arXiv:1901.07042, 1(2), 2019.
  27. CheXpert: a large chest radiograph dataset with uncertainty labels and expert comparison. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 590–597, 2019.
  28. Adam: a method for stochastic optimization. In International Conference on Learning Representations (ICLR), pages 1–15, 2015.
  29. Vladimir Vapnik. Principles of risk minimization for learning theory. Adv. Neural Inf. Process. Syst., 4, 1991. ISSN 1049-5258.
  30. Achieving fairness through adversarial learning: an application to recidivism prediction. June 2018.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Mingquan Lin (19 papers)
  2. Tianhao Li (35 papers)
  3. Zhaoyi Sun (6 papers)
  4. Gregory Holste (10 papers)
  5. Ying Ding (126 papers)
  6. Fei Wang (573 papers)
  7. George Shih (16 papers)
  8. Yifan Peng (147 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com