Papers
Topics
Authors
Recent
2000 character limit reached

AFEN: Respiratory Disease Classification using Ensemble Learning (2405.05467v1)

Published 8 May 2024 in cs.SD, cs.AI, cs.LG, and eess.AS

Abstract: We present AFEN (Audio Feature Ensemble Learning), a model that leverages Convolutional Neural Networks (CNN) and XGBoost in an ensemble learning fashion to perform state-of-the-art audio classification for a range of respiratory diseases. We use a meticulously selected mix of audio features which provide the salient attributes of the data and allow for accurate classification. The extracted features are then used as an input to two separate model classifiers 1) a multi-feature CNN classifier and 2) an XGBoost Classifier. The outputs of the two models are then fused with the use of soft voting. Thus, by exploiting ensemble learning, we achieve increased robustness and accuracy. We evaluate the performance of the model on a database of 920 respiratory sounds, which undergoes data augmentation techniques to increase the diversity of the data and generalizability of the model. We empirically verify that AFEN sets a new state-of-the-art using Precision and Recall as metrics, while decreasing training time by 60%.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. An ensemble approach for classification of tympanic membrane conditions using soft voting classifier. Multimedia Tools and Applications, February 2024. ISSN 1573-7721. 10.1007/s11042-024-18631-z. URL https://doi.org/10.1007/s11042-024-18631-z.
  2. A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds. Multimedia Tools and Applications, March 2024. ISSN 1573-7721. 10.1007/s11042-024-18703-0. URL https://doi.org/10.1007/s11042-024-18703-0.
  3. Classification of lung sounds using convolutional neural networks. EURASIP Journal on Image and Video Processing, 2017(1):65, September 2017. ISSN 1687-5281. 10.1186/s13640-017-0213-2. URL https://doi.org/10.1186/s13640-017-0213-2.
  4. Respiratory disorder classification based on lung auscultation using MFCC, Mel Spectrogram and Chroma STFT. In 2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1), pages 1–7, April 2023. 10.1109/ICAIA57370.2023.10169299. URL https://ieeexplore.ieee.org/document/10169299.
  5. Automatic Detection of Patient with Respiratory Diseases Using Lung Sound Analysis. In 2018 International Conference on Content-Based Multimedia Indexing (CBMI), pages 1–6, September 2018. 10.1109/CBMI.2018.8516489. URL https://ieeexplore.ieee.org/abstract/document/8516489?casa_token=CRyT56UzoDcAAAAA:4MzVSLM8B7duXUATy-LQ3Dhj1GaXuLsckSmu7Duu16t7ikU8qkCQBTIoLc_cHr7rhdlGQ5D6vio.
  6. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, pages 785–794, New York, NY, USA, August 2016a. Association for Computing Machinery. ISBN 978-1-4503-4232-2. 10.1145/2939672.2939785. URL https://dl.acm.org/doi/10.1145/2939672.2939785.
  7. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016b.
  8. Global, regional, and national burden of chronic respiratory diseases and associated risk factors, 1990–2019: Results from the Global Burden of Disease Study 2019. Frontiers in Medicine, 10:1066804, March 2023. ISSN 2296-858X. 10.3389/fmed.2023.1066804. URL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10088372/.
  9. On the Use of Zero-Crossing Rate for an Application of Classification of Percussive Sounds. August 2002.
  10. Hakan Gunduz. Deep Learning-Based Parkinson’s Disease Classification Using Vocal Feature Sets. IEEE Access, 7:115540–115551, 2019. ISSN 2169-3536. 10.1109/ACCESS.2019.2936564. URL https://ieeexplore.ieee.org/abstract/document/8807125. Conference Name: IEEE Access.
  11. DeepBreath—automated detection of respiratory pathology from lung auscultation in 572 pediatric outpatients across 5 countries. npj Digital Medicine, 6(1):1–12, June 2023. ISSN 2398-6352. 10.1038/s41746-023-00838-3. URL https://www.nature.com/articles/s41746-023-00838-3. Publisher: Nature Publishing Group.
  12. Early Diagnosis and Real-Time Monitoring of Regional Lung Function Changes to Prevent Chronic Obstructive Pulmonary Disease Progression to Severe Emphysema. Journal of Clinical Medicine, 10(24):5811, December 2021. ISSN 2077-0383. 10.3390/jcm10245811. URL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708661/.
  13. HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis. In Advances in Neural Information Processing Systems, volume 33, pages 17022–17033. Curran Associates, Inc., 2020. URL https://proceedings.neurips.cc/paper_files/paper/2020/hash/c5d736809766d46260d816d8dbc9eb44-Abstract.html.
  14. Acoustic classification and segmentation using modified spectral roll-off and variance-based features. Digital Signal Processing, 23(2):659–674, March 2013. ISSN 1051-2004. 10.1016/j.dsp.2012.10.008. URL https://www.sciencedirect.com/science/article/pii/S1051200412002473.
  15. AudioGen: Textually Guided Audio Generation, March 2023. URL http://arxiv.org/abs/2209.15352. arXiv:2209.15352 [cs, eess].
  16. Content-based audio classification and segmentation by using support vector machines. Multimedia Systems, 8(6):482–492, April 2003. ISSN 1432-1882. 10.1007/s00530-002-0065-0. URL https://doi.org/10.1007/s00530-002-0065-0.
  17. SampleRNN: An Unconditional End-to-End Neural Audio Generation Model, December 2016. URL https://arxiv.org/abs/1612.07837v2.
  18. Respiratory Disease Classification by CNN using MFCC. In 2021 IEEE 6th International Conference on Computing, Communication and Automation (ICCCA), pages 517–523, December 2021. 10.1109/ICCCA52192.2021.9666346. URL https://ieeexplore.ieee.org/document/9666346/citations?tabFilter=papers#citations. ISSN: 2642-7354.
  19. Automatic adventitious respiratory sound analysis: A systematic review. PLOS ONE, 12(5):e0177926, May 2017. ISSN 1932-6203. 10.1371/journal.pone.0177926. URL https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177926. Publisher: Public Library of Science.
  20. A respiratory sound database for the development of automated classification. In Nicos Maglaveras, Ioanna Chouvarda, and Paulo de Carvalho, editors, Precision Medicine Powered by pHealth and Connected Health, pages 33–37. Springer, 2018. ISBN 978-981-10-7419-6. 10.1007/978-981-10-7419-6_6.
  21. An open access database for the evaluation of respiratory sound classification algorithms. Physiological measurement, 40(3):035001, 2019.
  22. Deep Recurrent Neural Networks for Audio Classification in Construction Sites. In 2020 28th European Signal Processing Conference (EUSIPCO), pages 810–814, January 2021. 10.23919/Eusipco47968.2020.9287802. URL https://ieeexplore.ieee.org/abstract/document/9287802. ISSN: 2076-1465.
  23. AHI estimation of OSAHS patients based on snoring classification and fusion model. American Journal of Otolaryngology, 44(5):103964, September 2023. ISSN 0196-0709. 10.1016/j.amjoto.2023.103964. URL https://www.sciencedirect.com/science/article/pii/S0196070923001783.

Summary

We haven't generated a summary for this paper yet.

Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

Sign up for free to view the 2 tweets with 1 like about this paper.