Domain Randomization and Generative Models for Robotic Grasping (1710.06425v2)

Published 17 Oct 2017 in cs.RO and cs.LG

Abstract: Deep learning-based robotic grasping has made significant progress thanks to algorithmic improvements and increased data availability. However, state-of-the-art models are often trained on as few as hundreds or thousands of unique object instances, and as a result generalization can be a challenge. In this work, we explore a novel data generation pipeline for training a deep neural network to perform grasp planning that applies the idea of domain randomization to object synthesis. We generate millions of unique, unrealistic procedurally generated objects, and train a deep neural network to perform grasp planning on these objects. Since the distribution of successful grasps for a given object can be highly multimodal, we propose an autoregressive grasp planning model that maps sensor inputs of a scene to a probability distribution over possible grasps. This model allows us to sample grasps efficiently at test time (or avoid sampling entirely). We evaluate our model architecture and data generation pipeline in simulation and the real world. We find we can achieve a $>$90% success rate on previously unseen realistic objects at test time in simulation despite having only been trained on random objects. We also demonstrate an 80% success rate on real-world grasp attempts despite having only been trained on random simulated objects.

View on arXiv

Authors (11)

Joshua Tobin (5 papers)
Lukas Biewald (1 paper)
Rocky Duan (1 paper)
Marcin Andrychowicz (22 papers)
Ankur Handa (39 papers)
Vikash Kumar (70 papers)
Bob McGrew (11 papers)
Jonas Schneider (18 papers)
Peter Welinder (15 papers)
Wojciech Zaremba (34 papers)
Pieter Abbeel (372 papers)

Citations (169)

View on Semantic Scholar

Summary

Domain Randomization and Generative Models for Robotic Grasping

The paper "Domain Randomization and Generative Models for Robotic Grasping" addresses a critical challenge in robotic manipulation: effective generalization in robotic grasping tasks using deep learning methodologies. The authors propose a novel approach that combines domain randomization with generative modeling to enhance the performance of grasp planning algorithms.

Overview

Robotic grasping traditionally relies on either analytical or empirical methods. Analytical approaches provide theoretical guarantees by optimizing grasp metrics but often fall short in real-world applications due to inaccuracies in models and sensors. Empirical methods attempt to circumvent these limitations by learning grasp strategies through data-driven techniques, including deep neural networks. However, these methods are constrained by the availability of diverse training data, as they typically rely on limited sets of realistic object meshes.

The paper introduces a sophisticated data generation pipeline using domain randomization, whereby millions of procedurally generated, unrealistic objects are synthesized for training purposes. The hypothesis is that exposure to a wide variety of object shapes enables deep neural networks to generalize effectively to real-world grasping tasks. The experiments confirm this hypothesis, demonstrating a 92% success rate on YCB dataset objects and an 80% success rate in real-world tests using models trained exclusively on simulated data.

Methodology and Results

The paper presents a detailed description of the data generation process, involving the synthesis of objects from decomposed ShapeNet primitives. The objects serve as the basis for training an autoregressive grasp planning model. This model employs deep neural architectures that map sensor inputs to a multimodal distribution over potential grasps. A unique feature of the model is its ability to efficiently sample grasps, leveraging the autoregressive approach to factorize complex probability distributions.

Key findings include:

An impressive >90% success rate in simulation tests on unseen realistic objects from previously unseen data.
Comparable performance between models trained on unrealistic and realistic datasets.
The autoregressive model's ability to identify a successful grasp within the top 20 samples for 96% of test objects.

Implications

The implications of this research are multifaceted. Practically, the approach can significantly reduce the resource-intensive task of collecting realistic object data by utilizing procedurally generated objects. Theoretically, it emphasizes the importance of diversity in training datasets for generalization tasks in robotics. By confirming that models trained with domain randomization can perform well on real-world tasks, the paper opens avenues for applying similar techniques to other robotic manipulation challenges, such as tool use and grasping in cluttered environments.

Future Directions

The paper suggests several directions for future research. Scaling up training sets and refining feedback mechanisms from failed grasps could enhance model performance further. Integrating additional sensor modalities like haptic feedback and exploring visual servoing to complement grasp planning are promising avenues. Moreover, the potential application of domain randomization across other robotic tasks offers an exciting prospect for researchers aiming to improve robot autonomy and adaptability.

In conclusion, the application of domain randomization and autoregressive models to robotic grasping presents a compelling method to address generalization challenges, reflecting a significant step forward in the pursuit of robust, adaptable robotic systems.

PDF Markdown

Related Papers

Find Related Papers