Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study (1906.01603v2)

Published 4 Jun 2019 in cs.CL, cs.AI, and cs.LG

Abstract: Neural generative models have been become increasingly popular when building conversational agents. They offer flexibility, can be easily adapted to new domains, and require minimal domain engineering. A common criticism of these systems is that they seldom understand or use the available dialog history effectively. In this paper, we take an empirical approach to understanding how these models use the available dialog history by studying the sensitivity of the models to artificially introduced unnatural changes or perturbations to their context at test time. We experiment with 10 different types of perturbations on 4 multi-turn dialog datasets and find that commonly used neural dialog architectures like recurrent and transformer-based seq2seq models are rarely sensitive to most perturbations such as missing or reordering utterances, shuffling words, etc. Also, by open-sourcing our code, we believe that it will serve as a useful diagnostic tool for evaluating dialog systems in the future.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Chinnadhurai Sankar (23 papers)
  2. Sandeep Subramanian (24 papers)
  3. Christopher Pal (97 papers)
  4. Sarath Chandar (93 papers)
  5. Yoshua Bengio (601 papers)
Citations (120)