Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation (2407.20989v1)
Abstract: We contrast high effectiveness of state of the art deep learning architectures designed for general audio classification tasks, refined for respiratory insufficiency (RI) detection and blood oxygen saturation (SpO$_2$) estimation and classification through automated audio analysis. Recently, multiple deep learning architectures have been proposed to detect RI in COVID patients through audio analysis, achieving accuracy above 95% and F1-score above 0.93. RI is a condition associated with low SpO$_2$ levels, commonly defined as the threshold SpO$_2$ <92%. While SpO$_2$ serves as a crucial determinant of RI, a medical doctor's diagnosis typically relies on multiple factors. These include respiratory frequency, heart rate, SpO$_2$ levels, among others. Here we study pretrained audio neural networks (CNN6, CNN10 and CNN14) and the Masked Autoencoder (Audio-MAE) for RI detection, where these models achieve near perfect accuracy, surpassing previous results. Yet, for the regression task of estimating SpO$_2$ levels, the models achieve root mean square error values exceeding the accepted clinical range of 3.5% for finger oximeters. Additionally, Pearson correlation coefficients fail to surpass 0.3. As deep learning models perform better in classification than regression, we transform SpO$_2$-regression into a SpO$_2$-threshold binary classification problem, with a threshold of 92%. However, this task still yields an F1-score below 0.65. Thus, audio analysis offers valuable insights into a patient's RI status, but does not provide accurate information about actual SpO$_2$ levels, indicating a separation of domains in which voice and speech biomarkers may and may not be useful in medical diagnostics under current technologies.
- doi:10.1007/978-3-031-34344-5_32. URL https://doi.org/10.1007/978-3-031-34344-5_32
- doi:10.11606/9786587596198. URL http://www.livrosabertos.sibi.usp.br/portaldelivrosUSP/catalog/book/647
- doi:10.1109/OJEMB.2020.3026468.
- doi:10.1109/ojemb.2020.3026928.
- doi:https://doi.org/10.1016/j.compbiomed.2021.104944. URL https://www.sciencedirect.com/science/article/pii/S0010482521007381
- doi:10.21437/INTERSPEECH.2022-10961. URL https://doi.org/10.21437/Interspeech.2022-10961
- doi:10.18653/v1/N19-1423. URL https://aclanthology.org/N19-1423