Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards End-to-End Code-Switching Speech Recognition (1810.13091v2)

Published 31 Oct 2018 in cs.CL and eess.AS

Abstract: Code-switching speech recognition has attracted an increasing interest recently, but the need for expert linguistic knowledge has always been a big issue. End-to-end automatic speech recognition (ASR) simplifies the building of ASR systems considerably by predicting graphemes or characters directly from acoustic input. In the mean time, the need of expert linguistic knowledge is also eliminated, which makes it an attractive choice for code-switching ASR. This paper presents a hybrid CTC-Attention based end-to-end Mandarin-English code-switching (CS) speech recognition system and studies the effect of hybrid CTC-Attention based models, different modeling units, the inclusion of language identification and different decoding strategies on the task of code-switching ASR. On the SEAME corpus, our system achieves a mixed error rate (MER) of 34.24%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ne Luo (5 papers)
  2. Dongwei Jiang (16 papers)
  3. Shuaijiang Zhao (10 papers)
  4. Caixia Gong (5 papers)
  5. Wei Zou (62 papers)
  6. Xiangang Li (46 papers)
Citations (47)

Summary

We haven't generated a summary for this paper yet.