Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RMM: A Recursive Mental Model for Dialog Navigation (2005.00728v2)

Published 2 May 2020 in cs.CL, cs.AI, cs.CV, cs.LG, and cs.RO

Abstract: Language-guided robots must be able to both ask humans questions and understand answers. Much existing work focuses only on the latter. In this paper, we go beyond instruction following and introduce a two-agent task where one agent navigates and asks questions that a second, guiding agent answers. Inspired by theory of mind, we propose the Recursive Mental Model (RMM). The navigating agent models the guiding agent to simulate answers given candidate generated questions. The guiding agent in turn models the navigating agent to simulate navigation steps it would take to generate answers. We use the progress agents make towards the goal as a reinforcement learning reward signal to directly inform not only navigation actions, but also both question and answer generation. We demonstrate that RMM enables better generalization to novel environments. Interlocutor modelling may be a way forward for human-agent dialogue where robots need to both ask and answer questions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Homero Roman Roman (1 paper)
  2. Yonatan Bisk (91 papers)
  3. Jesse Thomason (65 papers)
  4. Asli Celikyilmaz (81 papers)
  5. Jianfeng Gao (344 papers)
Citations (45)

Summary

We haven't generated a summary for this paper yet.