Improving Device Directedness Classification of Utterances with Semantic Lexical Features (2010.01949v1)

Published 29 Sep 2020 in eess.AS, cs.CL, cs.LG, and cs.SD

Abstract: User interactions with personal assistants like Alexa, Google Home and Siri are typically initiated by a wake term or wakeword. Several personal assistants feature "follow-up" modes that allow users to make additional interactions without the need of a wakeword. For the system to only respond when appropriate, and to ignore speech not intended for it, utterances must be classified as device-directed or non-device-directed. State-of-the-art systems have largely used acoustic features for this task, while others have used only lexical features or have added LM-based lexical features. We propose a directedness classifier that combines semantic lexical features with a lightweight acoustic feature and show it is effective in classifying directedness. The mixed-domain lexical and acoustic feature model is able to achieve 14% relative reduction of EER over a state-of-the-art acoustic-only baseline model. Finally, we successfully apply transfer learning and semi-supervised learning to the model to improve accuracy even further.

Authors (5)

Kellen Gillespie (3 papers)
Ioannis C. Konstantakopoulos (9 papers)
Xingzhi Guo (12 papers)
Vishal Thanvantri Vasudevan (1 paper)
Abhinav Sethy (14 papers)

Citations (15)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Improving Device Directedness Classification of Utterances with Semantic Lexical Features (2010.01949v1)

Summary

Related Papers