Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DROID: Minimizing the Reality Gap using Single-Shot Human Demonstration (2102.11003v2)

Published 22 Feb 2021 in cs.RO

Abstract: Reinforcement learning (RL) has demonstrated great success in the past several years. However, most of the scenarios focus on simulated environments. One of the main challenges of transferring the policy learned in a simulated environment to real world, is the discrepancy between the dynamics of the two environments. In prior works, Domain Randomization (DR) has been used to address the reality gap for both robotic locomotion and manipulation tasks. In this paper, we propose Domain Randomization Optimization IDentification (DROID), a novel framework to exploit single-shot human demonstration for identifying the simulator's distribution of dynamics parameters, and apply it to training a policy on a door opening task. Our results show that the proposed framework can identify the difference in dynamics between the simulated and the real worlds, and thus improve policy transfer by optimizing the simulator's randomization ranges. We further illustrate that based on these same identified parameters, our method can generalize the learned policy to different but related tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ya-Yen Tsai (7 papers)
  2. Hui Xu (121 papers)
  3. Zihan Ding (38 papers)
  4. Chong Zhang (137 papers)
  5. Edward Johns (49 papers)
  6. Bidan Huang (8 papers)
Citations (25)

Summary

We haven't generated a summary for this paper yet.