Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
134 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Phrase-Level Class based Language Model for Mandarin Smart Speaker Query Recognition (1909.00556v1)

Published 2 Sep 2019 in cs.CL

Abstract: The success of speech assistants requires precise recognition of a number of entities on particular contexts. A common solution is to train a class-based n-gram LLM and then expand the classes into specific words or phrases. However, when the class has a huge list, e.g., more than 20 million songs, a fully expansion will cause memory explosion. Worse still, the list items in the class need to be updated frequently, which requires a dynamic model updating technique. In this work, we propose to train pruned LLMs for the word classes to replace the slots in the root n-gram. We further propose to use a novel technique, named Difference LLM (DLM), to correct the bias from the pruned LLMs. Once the decoding graph is built, we only need to recalculate the DLM when the entities in word classes are updated. Results show that the proposed method consistently and significantly outperforms the conventional approaches on all datasets, esp. for large lists, which the conventional approaches cannot handle.

Citations (1)

Summary

We haven't generated a summary for this paper yet.