MA-DST: Multi-Attention Based Scalable Dialog State Tracking (2002.08898v1)

Published 7 Feb 2020 in cs.CL, cs.AI, cs.LG, and stat.ML

Abstract: Task oriented dialog agents provide a natural language interface for users to complete their goal. Dialog State Tracking (DST), which is often a core component of these systems, tracks the system's understanding of the user's goal throughout the conversation. To enable accurate multi-domain DST, the model needs to encode dependencies between past utterances and slot semantics and understand the dialog context, including long-range cross-domain references. We introduce a novel architecture for this task to encode the conversation history and slot semantics more robustly by using attention mechanisms at multiple granularities. In particular, we use cross-attention to model relationships between the context and slots at different semantic levels and self-attention to resolve cross-domain coreferences. In addition, our proposed architecture does not rely on knowing the domain ontologies beforehand and can also be used in a zero-shot setting for new domains or unseen slot values. Our model improves the joint goal accuracy by 5% (absolute) in the full-data setting and by up to 2% (absolute) in the zero-shot setting over the present state-of-the-art on the MultiWoZ 2.1 dataset.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (5)

Adarsh Kumar (26 papers)
Peter Ku (3 papers)
Anuj Kumar Goyal (2 papers)
Angeliki Metallinou (14 papers)
Dilek Hakkani-Tur (94 papers)

Citations (58)

View on Semantic Scholar

MA-DST: Multi-Attention Based Scalable Dialog State Tracking (2002.08898v1)

Related Papers