2000 character limit reached
Ask No More: Deciding when to guess in referential visual dialogue (1805.06960v2)
Published 17 May 2018 in cs.CL, cs.CV, and cs.MM
Abstract: Our goal is to explore how the abilities brought in by a dialogue manager can be included in end-to-end visually grounded conversational agents. We make initial steps towards this general goal by augmenting a task-oriented visual dialogue model with a decision-making component that decides whether to ask a follow-up question to identify a target referent in an image, or to stop the conversation to make a guess. Our analyses show that adding a decision making component produces dialogues that are less repetitive and that include fewer unnecessary questions, thus potentially leading to more efficient and less unnatural interactions.
- Ravi Shekhar (11 papers)
- Aashish Venkatesh (2 papers)
- Elia Bruni (32 papers)
- Raffaella Bernardi (24 papers)
- Tim Baumgartner (1 paper)
- Raquel Fernandez (2 papers)