Modeling Long Context for Task-Oriented Dialogue State Generation (2004.14080v1)

Published 29 Apr 2020 in cs.CL

Abstract: Based on the recently proposed transferable dialogue state generator (TRADE) that predicts dialogue states from utterance-concatenated dialogue context, we propose a multi-task learning model with a simple yet effective utterance tagging technique and a bidirectional LLM as an auxiliary task for task-oriented dialogue state generation. By enabling the model to learn a better representation of the long dialogue context, our approaches attempt to solve the problem that the performance of the baseline significantly drops when the input dialogue context sequence is long. In our experiments, our proposed model achieves a 7.03% relative improvement over the baseline, establishing a new state-of-the-art joint goal accuracy of 52.04% on the MultiWOZ 2.0 dataset.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (2)

Jun Quan (7 papers)
Deyi Xiong (103 papers)

Citations (17)

View on Semantic Scholar

Modeling Long Context for Task-Oriented Dialogue State Generation (2004.14080v1)

Related Papers