Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline Algorithm: Application to the ICU Length of Stay Prediction (2401.00902v1)
Abstract: This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the Intensive Care Unit (ICU) length of stay (LOS). Highlighting the critical role of the ICU in managing critically ill patients, the study addresses the growing strain on ICU capacity. It emphasizes the significance of LOS prediction for resource allocation. The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction. While the XGBoost model performs well overall, disparities across race and insurance attributes reflect the need for tailored assessments and continuous monitoring. The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
- Predicting intensive care unit length of stay and mortality using patient vital signs: Machine learning model development and validation. JMIR Medical Informatics, 9:e21347, 2021.
- Quality measurement at intensive care units: Which indicators should we use? Journal of Critical Care, 22:267–274, 2007.
- Association between insurance status and hospital length of stay following trauma. The American Surgeon, 82(3):281–288, 2016.
- Equality of opportunity in supervised learning. arXiv, 2016.
- Prediction of intensive care unit length of stay in the MIMIC-IV dataset. Applied Sciences, 13(12):6930, 2023.
- MIMIC-IV, a freely accessible electronic health record dataset. Scientific Data, 10(1):1, 2023.
- Improving fairness in the prediction of heart failure length of stay and mortality by integrating social determinants of health. Circulation: Heart Failure, 15(11):e009473, 2022.
- K. Mathews and E. Long. A conceptual framework for improving critical care patient flow and bed use. Annals of the American Thoracic Society, 12(6):886–894, June 2015.
- J. Paulus and D. Kent. Predictably unequal: Understanding and addressing concerns that algorithmic clinical prediction may increase health disparities. NPJ Digital Medicine, 3:99, 2020.
- Length of stay data as a guide to hospital economic performance for ICU patients. Medical Care, 41:386–397, 2003.
- A systematic review of the prediction of hospital length of stay: Towards a unified framework. PLOS Digital Health, 1(4):e0000017, April 2022.
- Modeling techniques for machine learning fairness: A survey. arXiv, 2022.
- M. Weil and W. Tang. From intensive care to critical care medicine: A historical perspective. Am. J. Respir. Crit. Care Med., 183:1451–1453, 2011.
- Alexandra Kakadiaris (1 paper)