Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation (2008.08113v1)

Published 18 Aug 2020 in eess.AS, cs.CL, cs.LG, and cs.SD

Abstract: False triggers in voice assistants are unintended invocations of the assistant, which not only degrade the user experience but may also compromise privacy. False trigger mitigation (FTM) is a process to detect the false trigger events and respond appropriately to the user. In this paper, we propose a novel solution to the FTM problem by introducing a parallel ASR decoding process with a special LLM trained from "out-of-domain" data sources. Such LLM is complementary to the existing LLM optimized for the assistant task. A bidirectional lattice RNN (Bi-LRNN) classifier trained from the lattices generated by the complementary LLM shows a $38.34\%$ relative reduction of the false trigger (FT) rate at the fixed rate of $0.4\%$ false suppression (FS) of correct invocations, compared to the current Bi-LRNN model. In addition, we propose to train a parallel Bi-LRNN model based on the decoding lattices from both LLMs, and examine various ways of implementation. The resulting model leads to further reduction in the false trigger rate by $10.8\%$.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Rishika Agarwal (2 papers)
  2. Xiaochuan Niu (5 papers)
  3. Pranay Dighe (14 papers)
  4. Srikanth Vishnubhotla (5 papers)
  5. Sameer Badaskar (2 papers)
  6. Devang Naik (26 papers)
Citations (3)