Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EchoVest: Real-Time Sound Classification and Depth Perception Expressed through Transcutaneous Electrical Nerve Stimulation (2307.04604v1)

Published 10 Jul 2023 in cs.SD, cs.LG, eess.AS, and eess.SP

Abstract: Over 1.5 billion people worldwide live with hearing impairment. Despite various technologies that have been created for individuals with such disabilities, most of these technologies are either extremely expensive or inaccessible for everyday use in low-medium income countries. In order to combat this issue, we have developed a new assistive device, EchoVest, for blind/deaf people to intuitively become more aware of their environment. EchoVest transmits vibrations to the user's body by utilizing transcutaneous electric nerve stimulation (TENS) based on the source of the sounds. EchoVest also provides various features, including sound localization, sound classification, noise reduction, and depth perception. We aimed to outperform CNN-based machine-learning models, the most commonly used machine learning model for classification tasks, in accuracy and computational costs. To do so, we developed and employed a novel audio pipeline that adapts the Audio Spectrogram Transformer (AST) model, an attention-based model, for our sound classification purposes, and Fast Fourier Transforms for noise reduction. The application of Otsu's Method helped us find the optimal thresholds for background noise sound filtering and gave us much greater accuracy. In order to calculate direction and depth accurately, we applied Complex Time Difference of Arrival algorithms and SOTA localization. Our last improvement was to use blind source separation to make our algorithms applicable to multiple microphone inputs. The final algorithm achieved state-of-the-art results on numerous checkpoints, including a 95.7\% accuracy on the ESC-50 dataset for environmental sound classification.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (17)
  1. A model for estimating hearing aid coverage world-wide using historical data on hearing aid sales. International Journal of Audiology, 61(10), 841–849. https://doi.org/10.1080/14992027.2021.1962551
  2. Otsu’s Threshold Selection Method Applied in De-noising Heart Sound of the Digital Stethoscope Record. Lecture Notes in Electrical Engineering, 239–244. https://doi.org/10.1007/978-3-642-26001-8_31
  3. AST: Audio Spectrogram Transformer. ArXiv:2104.01778 https://arxiv.org/abs/2104.01778
  4. Lightweight and optimized sound source localization and tracking methods for open and closed microphone array configurations. Robotics and Autonomous Systems, 113, 63–80. https://doi.org/10.1016/j.robot.2019.01.002
  5. Hearing Aids. FDA. https://www.fda.gov/medical-devices/consumer-products/hearing-aids
  6. Independent Component Analysis: Algorithms and Applications. Neural Networks, 13(45), 411–430. https://www.cs.helsinki.fi/u/ahyvarin/papers/NN00new.pdf
  7. Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
  8. Analysis of the GCC-PHAT technique for multiple sources. IEEE Xplore. https://doi.org/10.1109/ICCAS.2010.5670137
  9. Determinants of Hearing Aid Use Among Older Americans With Hearing Loss. The Gerontologist. https://doi.org/10.1093/geront/gny051
  10. National Institute on Deafness and other Communication Disorders. (2018, June 15). Cochlear Implants. NIDCD. https://www.nidcd.nih.gov/health/cochlear-implants
  11. Cochlear Implant: Cost, Pros, Cons, Risks, How It Works. Healthline. https://www.healthline.com/health/cochlear-implant
  12. Papers with Code - ESC-50 Benchmark (Audio Classification). (n.d.). Paperswithcode.com. Retrieved February 2, 2023, from https://paperswithcode.com/sota/audio-classification-on-esc-50
  13. Exploring Your Speech-to-text Options: Advantages and Disadvantages of Speech Recognition Software. Rev. https://www.rev.com/blog/speech-to-text-technology/advantages-and-disadvantages-of-speech-recognition-software
  14. Transcutaneous electrical nerve stimulator (TENS). (2018, April 4). University of Iowa Hospitals & Clinics. https://uihc.org/health-topics/transcutaneous-electrical-nerve-stimulator-tens
  15. Nonnegative Matrix Factorization: A Comprehensive Review. IEEE Transactions on Knowledge and Data Engineering, 25(6), 1336–1353. https://doi.org/10.1109/tkde.2012.51
  16. How Much Do Hearing Aids Cost? GoodRx. https://www.goodrx.com/health-topic/ear/hearing-aid-cost
  17. World Health Organization: WHO. (2019, September 18). Hearing loss. Who.int; World Health Organization: WHO. https://www.who.int/health-topics/hearing-loss#tab=tab_1‌

Summary

We haven't generated a summary for this paper yet.