Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning (2002.01963v2)

Published 5 Feb 2020 in cs.LG and stat.ML

Abstract: In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal. In the natural world, intelligent organisms learn from internal drives, bypassing the need for external signals, which is beneficial for a wide range of tasks. Motivated by this observation, we propose to formulate an intrinsic objective as the mutual information between the goal states and the controllable states. This objective encourages the agent to take control of its environment. Subsequently, we derive a surrogate objective of the proposed reward function, which can be optimized efficiently. Lastly, we evaluate the developed framework in different robotic manipulation and navigation tasks and demonstrate the efficacy of our approach. A video showing experimental results is available at https://youtu.be/CT4CKMWBYz0

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Rui Zhao (241 papers)
  2. Yang Gao (761 papers)
  3. Pieter Abbeel (372 papers)
  4. Volker Tresp (158 papers)
  5. Wei Xu (536 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.