Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation (2107.13545v3)

Published 28 Jul 2021 in cs.LG and cs.RO

Abstract: We study how robots can autonomously learn skills that require a combination of navigation and grasping. While reinforcement learning in principle provides for automated robotic skill learning, in practice reinforcement learning in the real world is challenging and often requires extensive instrumentation and supervision. Our aim is to devise a robotic reinforcement learning system for learning navigation and manipulation together, in an autonomous way without human intervention, enabling continual learning under realistic assumptions. Our proposed system, ReLMM, can learn continuously on a real-world platform without any environment instrumentation, without human intervention, and without access to privileged information, such as maps, objects positions, or a global view of the environment. Our method employs a modularized policy with components for manipulation and navigation, where manipulation policy uncertainty drives exploration for the navigation controller, and the manipulation module provides rewards for navigation. We evaluate our method on a room cleanup task, where the robot must navigate to and pick up items scattered on the floor. After a grasp curriculum training phase, ReLMM can learn navigation and grasping together fully automatically, in around 40 hours of autonomous real-world training.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Charles Sun (3 papers)
  2. Coline Devin (21 papers)
  3. Brian Yang (10 papers)
  4. Abhishek Gupta (226 papers)
  5. Glen Berseth (48 papers)
  6. Sergey Levine (531 papers)
  7. Jędrzej Orbik (1 paper)
Citations (44)

Summary

We haven't generated a summary for this paper yet.