Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Ask No More: Deciding when to guess in referential visual dialogue (1805.06960v2)

Published 17 May 2018 in cs.CL, cs.CV, and cs.MM

Abstract: Our goal is to explore how the abilities brought in by a dialogue manager can be included in end-to-end visually grounded conversational agents. We make initial steps towards this general goal by augmenting a task-oriented visual dialogue model with a decision-making component that decides whether to ask a follow-up question to identify a target referent in an image, or to stop the conversation to make a guess. Our analyses show that adding a decision making component produces dialogues that are less repetitive and that include fewer unnecessary questions, thus potentially leading to more efficient and less unnatural interactions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ravi Shekhar (11 papers)
  2. Aashish Venkatesh (2 papers)
  3. Elia Bruni (32 papers)
  4. Raffaella Bernardi (24 papers)
  5. Tim Baumgartner (1 paper)
  6. Raquel Fernandez (2 papers)
Citations (21)