NeuroVoz: a Castillian Spanish corpus of parkinsonian speech (2403.02371v2)
Abstract: The advancement of Parkinson's Disease (PD) diagnosis through speech analysis is hindered by a notable lack of publicly available, diverse language datasets, limiting the reproducibility and further exploration of existing research. In response to this gap, we introduce a comprehensive corpus from 108 native Castilian Spanish speakers, comprising 55 healthy controls and 53 individuals diagnosed with PD, all of whom were under pharmacological treatment and recorded in their medication-optimized state. This unique dataset features a wide array of speech tasks, including sustained phonation of the five Spanish vowels, diadochokinetic tests, 16 listen-and-repeat utterances, and free monologues. The dataset emphasizes accuracy and reliability through specialist manual transcriptions of the listen-and-repeat tasks and utilizes Whisper for automated monologue transcriptions, making it the most complete public corpus of Parkinsonian speech, and the first in Castillian Spanish. NeuroVoz is composed by 2,903 audio recordings averaging $26.88 \pm 3.35$ recordings per participant, offering a substantial resource for the scientific exploration of PD's impact on speech. This dataset has already underpinned several studies, achieving a benchmark accuracy of 89% in PD speech pattern identification, indicating marked speech alterations attributable to PD. Despite these advances, the broader challenge of conducting a language-agnostic, cross-corpora analysis of Parkinsonian speech patterns remains an open area for future research. This contribution not only fills a critical void in PD speech analysis resources but also sets a new standard for the global research community in leveraging speech as a diagnostic tool for neurodegenerative diseases.
- De Rijk, M. d. et al. Prevalence of parkinson’s disease in europe: A collaborative study of population-based cohorts. neurologic diseases in the elderly research group. \JournalTitleNeurology 54, S21–3 (2000).
- Aarsland, D. et al. Cognitive decline in parkinson disease. \JournalTitleNature Reviews Neurology 13, 217–231 (2017).
- Friedman, J. H. et al. Fatigue in parkinson’s disease: report from a multidisciplinary symposium. \JournalTitleNPJ Parkinson’s disease 2, 1–6 (2016).
- Pfeiffer, R. F. Non-motor symptoms in parkinson’s disease. \JournalTitleParkinsonism & related disorders 22, S119–S122 (2016).
- Koga, S. et al. When dlb, pd, and psp masquerade as msa: an autopsy study of 134 patients. \JournalTitleNeurology 85, 404–412 (2015).
- The accuracy of diagnosis of parkinsonian syndromes in a specialist movement disorder service. \JournalTitleBrain: a Journal of Neurology 125 Pt 4, 861–70 (2002).
- Challenges in the diagnosis of parkinson’s disease. \JournalTitleThe Lancet Neurology 20, 385–397 (2021).
- Pujols, J. et al. Small molecule inhibits α𝛼\alphaitalic_α-synuclein aggregation, disrupts amyloid fibrils, and prevents degeneration of dopaminergic neurons. \JournalTitleProceedings of the National Academy of Sciences 115, 10481 – 10486 (2018).
- Speech treatment for parkinson’s disease. \JournalTitleExpert review of neurotherapeutics 8, 297–309 (2008).
- Swallowing and speech production in parkinson’s disease. \JournalTitleAnnals of neurology 19, 283–287 (1986).
- Weismer, G. et al. Articulatory characteristics of parkinsonian dysarthria: Segmental and phrase-level timing, spirantization, and glottal-supraglottal coordination. \JournalTitleThe dysarthrias: Physiology, acoustics, perception, management 101–130 (1984).
- Vowel articulation in parkinson’s disease. \JournalTitleJournal of voice 25, 467–472 (2011).
- Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated parkinson’s disease. \JournalTitleThe journal of the Acoustical Society of America 129, 350–367 (2011).
- Articulatory deficits in parkinsonian dysarthria: an acoustic analysis. \JournalTitleJournal of Neurology, Neurosurgery & Psychiatry 54, 1093–1098 (1991).
- Articulatory consequences of parkinson’s disease: perspectives from two modalities. \JournalTitleBrain and cognition 40, 355–386 (1999).
- Motor speech disorders associated with primary progressive aphasia. \JournalTitleAphasiology 28, 1004–1017 (2014).
- Advances in parkinson’s disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. \JournalTitleBiomedical Signal Processing and Control 66, 102418 (2021).
- A review of the use of prosodic aspects of speech for the automatic detection and assessment of parkinson’s disease. In Automatic Assessment of Parkinsonian Speech: First Workshop, AAPS 2019, Cambridge, Massachussets, USA, September 20–21, 2019, Revised Selected Papers 1, 42–59 (Springer, 2020).
- Basic parameters of articulatory movements and acoustics in individuals with parkinson’s disease. \JournalTitleMovement Disorders 27 (2012).
- Speech motor control in parkinson’s disease: a comparison between a clinical assessment protocol and a quantitative analysis of mandibular movements. \JournalTitleFolia Phoniatrica 45 4, 157–64 (1993).
- Acoustic analysis of pd speech. \JournalTitleParkinson’s Disease 2011 (2011).
- Towards the identification of idiopathic parkinson’s disease from the speech. new articulatory kinetic biomarkers. \JournalTitlePLoS ONE 12 (2017).
- Consonant distortions in dysarthria due to parkinson’s disease, amyotrophic lateral sclerosis and cerebellar ataxia. In Interspeech (2013).
- Formant trajectory characteristics in persons with parkinson, cerebellar, and upper motor neuron disease. \JournalTitleJournal of the Acoustical Society of America 103, 2892–2892 (1998).
- Rusz, J. et al. Imprecise vowel articulation as a potential early marker of parkinson’s disease: effect of speaking task. \JournalTitleThe Journal of the Acoustical Society of America 134 3, 2171–81 (2013).
- Acoustic and perceptual consequences of articulatory rate change in parkinson disease. \JournalTitleJournal of Speech, Language, and Hearing Research 45 1, 35–50 (2002).
- Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of parkinson patients. \JournalTitleJournal of Speech and hearing Disorders 43, 47–57 (1978).
- Midi, I. et al. Voice abnormalities and their relation with motor dysfunction in parkinson’s disease. \JournalTitleActa Neurologica Scandinavica 117, 26–34 (2008).
- The parkinson larynx: tremor and videostroboscopic findings. \JournalTitleJournal of Voice 10, 354–361 (1996).
- Acoustic analysis of voices of patients with neurologic disease: rationale and preliminary data. \JournalTitleAnnals of Otology, Rhinology & Laryngology 97, 164–172 (1988).
- Phonatory impairment in parkinson’s disease: evidence from nonlinear dynamic analysis and perturbation analysis. \JournalTitleJournal of Voice 21, 64–71 (2007).
- Vocal acoustic characteristics of patients with parkinson’s disease. \JournalTitleFolia Phoniatrica et logopaedica 63, 223–230 (2011).
- Rusz, J. et al. Evaluation of speech impairment in early stages of parkinson’s disease: a prospective study with the role of pharmacotherapy. \JournalTitleJournal of Neural Transmission 120, 319–329 (2013).
- Voice characteristics in the progression of parkinson’s disease. \JournalTitleInternational Journal of Language & Communication Disorders 35, 407–418 (2000).
- Ngo, Q. C. et al. Computerized analysis of speech and voice for parkinson’s disease: A systematic review. \JournalTitleComputer Methods and Programs in Biomedicine 107133 (2022).
- Novel speech signal processing algorithms for high-accuracy classification of parkinson’s disease. \JournalTitleIEEE transactions on biomedical engineering 59, 1264–1271 (2012).
- Hawi, S. et al. Automatic parkinson’s disease detection based on the combination of long-term acoustic features and mel frequency cepstral coefficients (mfcc). \JournalTitleBiomedical Signal Processing and Control 78, 104013 (2022).
- Vásquez-Correa, J. C. et al. Multimodal assessment of parkinson’s disease: a deep learning approach. \JournalTitleIEEE journal of biomedical and health informatics 23, 1618–1630 (2018).
- Predicting updrs scores in parkinson’s disease using voice signals: A deep learning/transfer-learning-based approach. In Automatic Assessment of Parkinsonian Speech: First Workshop, AAPS 2019, Cambridge, Massachussets, USA, September 20–21, 2019, Revised Selected Papers 1, 100–123 (Springer, 2020).
- Performance evaluation of rnn with hyperbolic secant in gate structure through application of parkinson’s disease detection. \JournalTitleApplied Sciences 11, 4361 (2021).
- New spanish speech corpus database for the analysis of people suffering from parkinson’s disease. In LREC, 342–347 (2014).
- Radford, A. et al. Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning, 28492–28518 (PMLR, 2023).
- Moro-Velazquez, L. et al. Use of acoustic landmarks and gmm-ubm blend in the automatic detection of parkinson’s disease. In Proceedings of the Models and Analysis of Vocal Emissions for Biomedical Applications: 10th International Workshop, Firenze, Italy, 13–15 (2017).
- Moro-Velazquez, L. et al. Study of the automatic detection of parkison’s disease based on speaker recognition technologies and allophonic distillation. In 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 1404–1407 (IEEE, 2018).
- Analysis of phonatory features for the automatic detection of parkinson’s disease in two different corpora. \JournalTitleModels and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) 33 (2019).
- Moro-Velazquez, L. et al. A forced gaussians based methodology for the differential evaluation of parkinson’s disease by means of speech processing. \JournalTitleBiomedical Signal Processing and Control 48, 205–220 (2019).
- Moro-Velazquez, L. et al. Phonetic relevance and phonemic grouping of speech in the automatic detection of parkinson’s disease. \JournalTitleScientific reports 9, 19066 (2019).
- Godino-Llorente, J. et al. Approaches to evaluate parkinsonian speech using artificial models. In Automatic Assessment of Parkinsonian Speech: First Workshop, AAPS 2019, Cambridge, Massachussets, USA, September 20–21, 2019, Revised Selected Papers 1, 77–99 (Springer, 2020).
- Towards a corpus (and language)-independent screening of parkinson’s disease from voice and speech through domain adaptation. \JournalTitleBioengineering 10, 1316 (2023).
- Assessment of speech intelligibility in parkinson’s disease using a speech-to-text system. \JournalTitleIEEE Access 5, 22199–22208 (2017).
- An integrated tool for the diagnosis of voice disorders. \JournalTitleMedical engineering & physics 28, 276–289 (2006).
- On the design of automatic voice condition analysis systems. part iii: Review of acoustic modelling strategies. \JournalTitleBiomedical Signal Processing and Control 66, 102049 (2021).
- Mendes-Laureano, J. et al. NeuroVoz: a Castillian Spanish corpus of parkinsonian speech (1.0.0), 10.5281/zenodo.10777656 (2024).