Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning (1902.04546v1)

Published 12 Feb 2019 in cs.LG, cs.AI, cs.NE, and stat.ML

Abstract: Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabeling the goals. Despite its effectiveness, HER has limited applicability because it lacks a compact and universal goal representation. We present Augmenting experienCe via TeacheR's adviCE (ACTRCE), an efficient reinforcement learning technique that extends the HER framework using natural language as the goal representation. We first analyze the differences among goal representation, and show that ACTRCE can efficiently solve difficult reinforcement learning problems in challenging 3D navigation tasks, whereas HER with non-language goal representation failed to learn. We also show that with language goal representations, the agent can generalize to unseen instructions, and even generalize to instructions with unseen lexicons. We further demonstrate it is crucial to use hindsight advice to solve challenging tasks, and even small amount of advice is sufficient for the agent to achieve good performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Harris Chan (13 papers)
  2. Yuhuai Wu (49 papers)
  3. Jamie Kiros (9 papers)
  4. Sanja Fidler (184 papers)
  5. Jimmy Ba (55 papers)
Citations (33)

Summary

We haven't generated a summary for this paper yet.