Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems (2112.08718v3)

Published 16 Dec 2021 in cs.CL and cs.LG

Abstract: Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains creating a need to adapt to new domains with small memory and deployment overhead. In this work, we introduce domain-prompts, a methodology that involves training a small number of domain embedding parameters to prime a Transformer-based LLM (LM) to a particular domain. Using this domain-adapted LM for rescoring ASR hypotheses can achieve 7-13% WER reduction for a new domain with just 1000 unlabeled textual domain-specific sentences. This improvement is comparable or even better than fully fine-tuned models even though just 0.02% of the parameters of the base LM are updated. Additionally, our method is deployment-friendly as the learnt domain embeddings are prefixed to the input to the model rather than changing the base model architecture. Therefore, our method is an ideal choice for on-the-fly adaptation of LMs used in ASR systems to progressively scale it to new domains.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (6)

Saket Dingliwal (22 papers)
Ashish Shenoy (13 papers)
Sravan Bodapati (31 papers)
Ankur Gandhe (30 papers)
Ravi Teja Gadde (6 papers)
Katrin Kirchhoff (36 papers)

Citations (3)

View on Semantic Scholar

Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems (2112.08718v3)

Related Papers