Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks (2003.00304v1)

Published 29 Feb 2020 in cs.CL, cs.SD, eess.AS, and stat.ML

Abstract: We propose a method to reduce false voice triggers of a speech-enabled personal assistant by post-processing the hypothesis lattice of a server-side large-vocabulary continuous speech recognizer (LVCSR) via a neural network. We first discuss how an estimate of the posterior probability of the trigger phrase can be obtained from the hypothesis lattice using known techniques to perform detection, then investigate a statistical model that processes the lattice in a more explicitly data-driven, discriminative manner. We propose using a Bidirectional Lattice Recurrent Neural Network (LatticeRNN) for the task, and show that it can significantly improve detection accuracy over using the 1-best result or the posterior.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Woojay Jeon (6 papers)
  2. Leo Liu (11 papers)
  3. Henry Mason (7 papers)
Citations (12)

Summary

We haven't generated a summary for this paper yet.