Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Code Switched and Code Mixed Speech Recognition for Indic languages (2203.16578v2)

Published 30 Mar 2022 in cs.CL and eess.AS

Abstract: Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and lexical information is typically language specific. Training multilingual system for Indic languages is even more tougher due to lack of open source datasets and results on different approaches. We compare the performance of end to end multilingual speech recognition system to the performance of monolingual models conditioned on language identification (LID). The decoding information from a multilingual model is used for language identification and then combined with monolingual models to get an improvement of 50% WER across languages. We also propose a similar technique to solve the Code Switched problem and achieve a WER of 21.77 and 28.27 over Hindi-English and Bengali-English respectively. Our work talks on how transformer based ASR especially wav2vec 2.0 can be applied in developing multilingual ASR and code switched ASR for Indic languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Harveen Singh Chadha (10 papers)
  2. Priyanshi Shah (10 papers)
  3. Ankur Dhuriya (8 papers)
  4. Neeraj Chhimwal (8 papers)
  5. Anirudh Gupta (9 papers)
  6. Vivek Raghavan (14 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.