Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards (1711.07614v1)

Published 21 Nov 2017 in cs.CV, cs.AI, and cs.CL

Abstract: Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge. Towards this end, we propose a Deep Reinforcement Learning framework based on three new intermediate rewards, namely goal-achieved, progressive and informativeness that encourage the generation of succinct questions, which in turn uncover valuable information towards the overall goal. By directly optimizing for questions that work quickly towards fulfilling the overall goal, we avoid the tendency of existing methods to generate long series of insane queries that add little value. We evaluate our model on the GuessWhat?! dataset and show that the resulting questions can help a standard Guesser identify a specific object in an image at a much higher success rate.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Junjie Zhang (79 papers)
  2. Qi Wu (323 papers)
  3. Chunhua Shen (404 papers)
  4. Jian Zhang (542 papers)
  5. Jianfeng Lu (273 papers)
  6. Anton van den Hengel (188 papers)
Citations (30)