Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer (2011.03148v2)

Published 6 Nov 2020 in cs.RO

Abstract: The success of deep reinforcement learning (RL) and imitation learning (IL) in vision-based robotic manipulation typically hinges on the expense of large scale data collection. With simulation, data to train a policy can be collected efficiently at scale, but the visual gap between sim and real makes deployment in the real world difficult. We introduce RetinaGAN, a generative adversarial network (GAN) approach to adapt simulated images to realistic ones with object-detection consistency. RetinaGAN is trained in an unsupervised manner without task loss dependencies, and preserves general object structure and texture in adapted images. We evaluate our method on three real world tasks: grasping, pushing, and door opening. RetinaGAN improves upon the performance of prior sim-to-real methods for RL-based object instance grasping and continues to be effective even in the limited data regime. When applied to a pushing task in a similar visual domain, RetinaGAN demonstrates transfer with no additional real data requirements. We also show our method bridges the visual gap for a novel door opening task using imitation learning in a new visual domain. Visit the project website at https://retinagan.github.io/

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Daniel Ho (18 papers)
  2. Kanishka Rao (31 papers)
  3. Zhuo Xu (82 papers)
  4. Eric Jang (19 papers)
  5. Mohi Khansari (18 papers)
  6. Yunfei Bai (21 papers)
Citations (88)
Github Logo Streamline Icon: https://streamlinehq.com

GitHub