Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Discriminative Entity-Aware Language Model for Virtual Assistants (2106.11292v1)

Published 21 Jun 2021 in cs.CL and cs.LG

Abstract: High-quality automatic speech recognition (ASR) is essential for virtual assistants (VAs) to work well. However, ASR often performs poorly on VA requests containing named entities. In this work, we start from the observation that many ASR errors on named entities are inconsistent with real-world knowledge. We extend previous discriminative n-gram LLMing approaches to incorporate real-world knowledge from a Knowledge Graph (KG), using features that capture entity type-entity and entity-entity relationships. We apply our model through an efficient lattice rescoring process, achieving relative sentence error rate reductions of more than 25% on some synthesized test sets covering less popular entities, with minimal degradation on a uniformly sampled VA test set.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Mandana Saebi (8 papers)
  2. Ernest Pusateri (10 papers)
  3. Aaksha Meghawat (2 papers)
  4. Christophe Van Gysel (24 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.