Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition (2011.01991v1)

Published 3 Nov 2020 in eess.AS, cs.CL, cs.LG, and cs.SD

Abstract: The external LLMs (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and LLMs. In this work, we propose an internal LM estimation (ILME) method to facilitate a more effective integration of the external LM with all pre-existing E2E models with no additional model training, including the most popular recurrent neural network transducer (RNN-T) and attention-based encoder-decoder (AED) models. Trained with audio-transcript pairs, an E2E model implicitly learns an internal LM that characterizes the training data in the source domain. With ILME, the internal LM scores of an E2E model are estimated and subtracted from the log-linear interpolation between the scores of the E2E model and the external LM. The internal LM scores are approximated as the output of an E2E model when eliminating its acoustic components. ILME can alleviate the domain mismatch between training and testing, or improve the multi-domain E2E ASR. Experimented with 30K-hour trained RNN-T and AED models, ILME achieves up to 15.5% and 6.8% relative word error rate reductions from Shallow Fusion on out-of-domain LibriSpeech and in-domain Microsoft production test sets, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Zhong Meng (53 papers)
  2. Sarangarajan Parthasarathy (9 papers)
  3. Eric Sun (14 papers)
  4. Yashesh Gaur (43 papers)
  5. Naoyuki Kanda (61 papers)
  6. Liang Lu (42 papers)
  7. Xie Chen (165 papers)
  8. Rui Zhao (241 papers)
  9. Jinyu Li (164 papers)
  10. Yifan Gong (82 papers)
Citations (105)