Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection (2206.13232v1)

Published 23 Jun 2022 in eess.AS, cs.LG, and cs.SD

Abstract: Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care to delay further progression. This paper presents the development of a state-of-the-art Conformer based speech recognition system built on the DementiaBank Pitt corpus for automatic AD detection. The baseline Conformer system trained with speed perturbation and SpecAugment based data augmentation is significantly improved by incorporating a set of purposefully designed modeling features, including neural architecture search based auto-configuration of domain-specific Conformer hyper-parameters in addition to parameter fine-tuning; fine-grained elderly speaker adaptation using learning hidden unit contributions (LHUC); and two-pass cross-system rescoring based combination with hybrid TDNN systems. An overall word error rate (WER) reduction of 13.6% absolute (34.8% relative) was obtained on the evaluation data of 48 elderly speakers. Using the final systems' recognition outputs to extract textual features, the best-published speech recognition based AD detection accuracy of 91.7% was obtained.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Tianzi Wang (37 papers)
  2. Jiajun Deng (75 papers)
  3. Mengzhe Geng (42 papers)
  4. Zi Ye (20 papers)
  5. Shoukang Hu (38 papers)
  6. Yi Wang (1038 papers)
  7. Mingyu Cui (31 papers)
  8. Zengrui Jin (30 papers)
  9. Xunying Liu (92 papers)
  10. Helen Meng (204 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.