Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline Algorithm: Application to the ICU Length of Stay Prediction
Abstract: This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the Intensive Care Unit (ICU) length of stay (LOS). Highlighting the critical role of the ICU in managing critically ill patients, the study addresses the growing strain on ICU capacity. It emphasizes the significance of LOS prediction for resource allocation. The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction. While the XGBoost model performs well overall, disparities across race and insurance attributes reflect the need for tailored assessments and continuous monitoring. The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
- Predicting intensive care unit length of stay and mortality using patient vital signs: Machine learning model development and validation. JMIR Medical Informatics, 9:e21347, 2021.
- Quality measurement at intensive care units: Which indicators should we use? Journal of Critical Care, 22:267–274, 2007.
- Association between insurance status and hospital length of stay following trauma. The American Surgeon, 82(3):281–288, 2016.
- Equality of opportunity in supervised learning. arXiv, 2016.
- Prediction of intensive care unit length of stay in the MIMIC-IV dataset. Applied Sciences, 13(12):6930, 2023.
- MIMIC-IV, a freely accessible electronic health record dataset. Scientific Data, 10(1):1, 2023.
- Improving fairness in the prediction of heart failure length of stay and mortality by integrating social determinants of health. Circulation: Heart Failure, 15(11):e009473, 2022.
- K. Mathews and E. Long. A conceptual framework for improving critical care patient flow and bed use. Annals of the American Thoracic Society, 12(6):886–894, June 2015.
- J. Paulus and D. Kent. Predictably unequal: Understanding and addressing concerns that algorithmic clinical prediction may increase health disparities. NPJ Digital Medicine, 3:99, 2020.
- Length of stay data as a guide to hospital economic performance for ICU patients. Medical Care, 41:386–397, 2003.
- A systematic review of the prediction of hospital length of stay: Towards a unified framework. PLOS Digital Health, 1(4):e0000017, April 2022.
- Modeling techniques for machine learning fairness: A survey. arXiv, 2022.
- M. Weil and W. Tang. From intensive care to critical care medicine: A historical perspective. Am. J. Respir. Crit. Care Med., 183:1451–1453, 2011.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.