Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EMALG: An Enhanced Mandarin Lombard Grid Corpus with Meaningful Sentences (2309.06858v2)

Published 13 Sep 2023 in cs.SD and eess.AS

Abstract: This study investigates the Lombard effect, where individuals adapt their speech in noisy environments. We introduce an enhanced Mandarin Lombard grid (EMALG) corpus with meaningful sentences , enhancing the Mandarin Lombard grid (MALG) corpus. EMALG features 34 speakers and improves recording setups, addressing challenges faced by MALG with nonsense sentences. Our findings reveal that in Mandarin, meaningful sentences are more effective in enhancing the Lombard effect. Additionally, we uncover that female exhibit a more pronounced Lombard effect than male when uttering meaningful sentences. Moreover, our results reaffirm the consistency in the Lombard effect comparison between English and Mandarin found in previous research.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. Etienne Lombard, “Le signe de televation de la voix,” Annu. maladies oreille larynx nez pharynx, vol. 27, pp. 101–119, 1911.
  2. “Understanding lombard speech: a review of compensation techniques towards improving speech based recognition systems,” Artificial Intelligence Review, vol. 54, pp. 2495–2523, 2021.
  3. “Applied principles of clear and lombard speech for automated intelligibility enhancement in noisy environments,” Speech Communication, vol. 48, no. 5, pp. 549–558, 2006.
  4. “Unsupervised equalization of lombard effect for speech recognition in noisy adverse environments,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 6, pp. 1379–1393, 2009.
  5. “Augmented cyclegans for continuous scale normal-to-lombard speaking style conversion.,” in Interspeech, 2019, pp. 2838–2842.
  6. “Cycle-consistent adversarial networks for non-parallel vocal effort based speaking style conversion,” in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019, pp. 6835–6839.
  7. “Acoustic and perceptual studies of lombard speech: Application to isolated-words automatic speech recognition,” in International conference on acoustics, speech, and signal processing. IEEE, 1990, pp. 841–844.
  8. “The lombard effect: A reflex to better communicate with others in noise,” in 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No. 99CH36258). IEEE, 1999, vol. 4, pp. 2083–2086.
  9. “An audio-visual corpus for speech perception and automatic speech recognition,” The Journal of the Acoustical Society of America, vol. 120, no. 5, pp. 2421–2424, 2006.
  10. “Effects of increasing sound pressure level on lip and jaw movement parameters and consistency in young adults,” 2006.
  11. “Analysis and compensation of lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 2, pp. 366–378, 2009.
  12. “Spectral and temporal changes to speech produced in the presence of energetic and informational maskers,” The Journal of the Acoustical Society of America, vol. 128, no. 4, pp. 2059–2069, 2010.
  13. “Glottal-based analysis of the lombard effect,” in Eleventh Annual Conference of the International Speech Communication Association, 2010.
  14. “Effect of noise type and level on focus related fundamental frequency changes,” in Thirteenth Annual Conference of the International Speech Communication Association, 2012.
  15. “Lombard effect in polish speech and its comparison in english speech,” Archives of Acoustics, vol. 42, 2017.
  16. “A study on the impact of lombard effect on recognition of hindi syllabic units using cnn based multimodal asr systems,” Archives of Acoustics, vol. 45, no. 3, pp. 419–431, 2020.
  17. “The effect of lexical frequency and lombard reflex on tone hyperarticulation,” Journal of Phonetics, vol. 37, no. 2, pp. 231–247, 2009.
  18. Sunhee Kim, “Durational characteristics of korean lombard speech,” in Ninth European Conference on Speech Communication and Technology, 2005.
  19. Katerina Nicolaidis, “Consonant production in greek lombard speech: An electropalatographic study,” Italian Journal of Linguistics, vol. 24, no. 1, pp. 65–101, 2012.
  20. “A corpus of audio-visual lombard speech with frontal and profile views,” The Journal of the Acoustical Society of America, vol. 143, no. 6, pp. EL523–EL529, 2018.
  21. “Mandarin lombard grid: A noise induced lombard-grid-like corpus of standard chinese,” 2022.
  22. “Variations in articulatory movement with changes in speech task,” 2004.
  23. “Understanding the lombard effect for mandarin: Relation between speech-recognition thresholds and acoustic parameters,” Available at SSRN 4330234.
  24. Paul Boersma, “Praat: doing phonetics by computer [computer program],” http://www. praat. org/, 2011.
  25. “Opensmile: the munich versatile and fast open-source audio feature extractor,” in Proceedings of the 18th ACM international conference on Multimedia, 2010, pp. 1459–1462.
  26. “The geneva minimalistic acoustic parameter set (gemaps) for voice research and affective computing,” IEEE transactions on affective computing, vol. 7, no. 2, pp. 190–202, 2015.
Citations (2)

Summary

We haven't generated a summary for this paper yet.