Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity (2111.01326v1)

Published 2 Nov 2021 in eess.AS, cs.CL, and cs.SD

Abstract: Speech processing systems currently do not support the vast majority of languages, in part due to the lack of data in low-resource languages. Cross-lingual transfer offers a compelling way to help bridge this digital divide by incorporating high-resource data into low-resource systems. Current cross-lingual algorithms have shown success in text-based tasks and speech-related tasks over some low-resource languages. However, scaling up speech systems to support hundreds of low-resource languages remains unsolved. To help bridge this gap, we propose a language similarity approach that can efficiently identify acoustic cross-lingual transfer pairs across hundreds of languages. We demonstrate the effectiveness of our approach in language family classification, speech recognition, and speech synthesis tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Peter Wu (32 papers)
  2. Jiatong Shi (82 papers)
  3. Yifan Zhong (13 papers)
  4. Shinji Watanabe (416 papers)
  5. Alan W Black (83 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.