Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Implicit Offline Reinforcement Learning via Supervised Learning (2210.12272v1)

Published 21 Oct 2022 in stat.ML, cs.LG, and cs.RO

Abstract: Offline Reinforcement Learning (RL) via Supervised Learning is a simple and effective way to learn robotic skills from a dataset collected by policies of different expertise levels. It is as simple as supervised learning and Behavior Cloning (BC), but takes advantage of return information. On datasets collected by policies of similar expertise, implicit BC has been shown to match or outperform explicit BC. Despite the benefits of using implicit models to learn robotic skills via BC, offline RL via Supervised Learning algorithms have been limited to explicit models. We show how implicit models can leverage return information and match or outperform explicit algorithms to acquire robotic skills from fixed datasets. Furthermore, we show the close relationship between our implicit methods and other popular RL via Supervised Learning algorithms to provide a unified framework. Finally, we demonstrate the effectiveness of our method on high-dimension manipulation and locomotion tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Rafael Pardinas (6 papers)
  2. David Vazquez (73 papers)
  3. Igor Mordatch (66 papers)
  4. Chris Pal (37 papers)
  5. Alexandre Piche (4 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.