- The paper introduces an AI-based smartphone application that achieves over 90% accuracy in preliminary COVID-19 diagnosis by integrating deep transfer learning and multiple classifiers.
- It employs advanced audio processing techniques, converting cough sounds into Mel spectrograms and MFCC features to facilitate robust machine learning analysis.
- The system’s multi-tiered architecture supports remote screening and public health monitoring while significantly reducing false positive and negative rates.
AI4COVID-19: AI-Enabled Preliminary Diagnosis for COVID-19 from Cough Samples via an App
The paper presents an approach leveraging AI for the preliminary diagnosis of COVID-19 through cough analysis. Deployable as a smartphone application, this system seeks to fill the void of scalable, accessible testing methods that are particularly pertinent during pandemics with high transmission rates like COVID-19.
Methodology and Implementation
The AI4COVID-19 application records a user’s cough and processes the audio data using a cloud-based AI engine to deliver a diagnostic result within two minutes. The methodology harnesses the distinctive pathomorphological changes induced in the respiratory system by COVID-19, hypothesizing that these changes result in unique cough characteristics that are discernible to AI models.
Data Challenges and Pre-processing
A significant challenge was the dearth of labeled COVID-19 cough data. To mitigate this, the paper employs transfer learning, utilizing pre-trained models on similar tasks to improve the AI's diagnostic accuracy. Data pre-processing includes conversion to Mel spectrograms and extraction of features such as Mel Frequency Cepstral Coefficients (MFCC), transforming audio data into formats amenable to machine learning.
AI Architecture
The AI architecture is notably risk-averse and composed of three parallel classifiers:
- Deep Transfer Learning-based Multi-Class (DTL-MC) Classifier - Uses a convolutional neural network (CNN) to distinguish COVID-19 from non-COVID-19 related coughs.
- Classical Machine Learning-based Multi-Class (CML-MC) Classifier - Utilizes support vector machines drawing on manually extracted features.
- Deep Transfer Learning-based Binary-Class (DTL-BC) Classifier - Focuses solely on binary classification between COVID-19 and non-COVID-19 coughs.
A mediator consolidates results from these classifiers, declaring a result as inconclusive if consensus is absent, thereby prioritizing the reduction of false diagnoses.
Results and Performance
The paper reports promising accuracy rates for the AI4COVID-19 system, with an emphasis on minimizing misdiagnosis. The approach achieves an overall classification accuracy exceeding 90% in some configurations, with the multi-pronged mediator approach noted for reducing both false positive and negative rates significantly.
Implications and Future Directions
This system offers several potential utilities:
- Remote Screening: Enables anytime, anywhere, individual-level preliminary screening, supporting overwhelmed healthcare systems and limiting pandemic spread.
- Complement to Clinical Tests: It is not intended to replace clinical-grade diagnostics but can serve as an adjunct tool, especially in resource-limited settings.
- Public Health Monitoring: By aggregating anonymized data, it can assist in broader epidemiological monitoring and analysis, aiding public health responses.
For future work, the paper suggests integrating additional data types such as breathing sounds and bio-markers to enhance diagnostic accuracy. Additionally, a greater diversity of cough samples from various other respiratory conditions is advised to strengthen model robustness.
Conclusion
The AI4COVID-19 provides an innovative, accessible alternative to traditional COVID-19 testing methods, capitalizing on AI's capability to discern subtle, condition-specific audio features. While the current model benefits from solid preliminary results, its practical efficacy will rely substantially on further development, extensive data collection, and real-world trials. The outlined framework exemplifies a pivotal step towards leveraging AI in global health crises, with implications extending beyond the immediate pandemic context.