Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Explore and Explain: Self-supervised Navigation and Recounting (2007.07268v1)

Published 14 Jul 2020 in cs.CV, cs.AI, cs.CL, and cs.RO

Abstract: Embodied AI has been recently gaining attention as it aims to foster the development of autonomous and intelligent agents. In this paper, we devise a novel embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path. In this context, the agent needs to navigate the environment driven by an exploration goal, select proper moments for description, and output natural language descriptions of relevant objects and scenes. Our model integrates a novel self-supervised exploration module with penalty, and a fully-attentive captioning model for explanation. Also, we investigate different policies for selecting proper moments for explanation, driven by information coming from both the environment and the navigation. Experiments are conducted on photorealistic environments from the Matterport3D dataset and investigate the navigation and explanation capabilities of the agent as well as the role of their interactions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Roberto Bigazzi (11 papers)
  2. Federico Landi (10 papers)
  3. Marcella Cornia (61 papers)
  4. Silvia Cascianelli (23 papers)
  5. Lorenzo Baraldi (69 papers)
  6. Rita Cucchiara (142 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.