Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Advancing Connectionist Temporal Classification With Attention Modeling (1803.05563v1)

Published 15 Mar 2018 in cs.CL

Abstract: In this study, we propose advancing all-neural speech recognition by directly incorporating attention modeling within the Connectionist Temporal Classification (CTC) framework. In particular, we derive new context vectors using time convolution features to model attention as part of the CTC network. To further improve attention modeling, we utilize content information extracted from a network representing an implicit LLM. Finally, we introduce vector based attention weights that are applied on context vectors across both time and their individual components. We evaluate our system on a 3400 hours Microsoft Cortana voice assistant task and demonstrate that our proposed model consistently outperforms the baseline model achieving about 20% relative reduction in word error rates.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Amit Das (28 papers)
  2. Jinyu Li (164 papers)
  3. Rui Zhao (241 papers)
  4. Yifan Gong (82 papers)
Citations (50)

Summary

We haven't generated a summary for this paper yet.