Jacquard: A Large Scale Dataset for Robotic Grasp Detection (1803.11469v2)

Published 30 Mar 2018 in cs.RO and cs.NE

Abstract: Grasping skill is a major ability that a wide number of real-life applications require for robotisation. State-of-the-art robotic grasping methods perform prediction of object grasp locations based on deep neural networks. However, such networks require huge amount of labeled data for training making this approach often impracticable in robotics. In this paper, we propose a method to generate a large scale synthetic dataset with ground truth, which we refer to as the Jacquard grasping dataset. Jacquard is built on a subset of ShapeNet, a large CAD models dataset, and contains both RGB-D images and annotations of successful grasping positions based on grasp attempts performed in a simulated environment. We carried out experiments using an off-the-shelf CNN, with three different evaluation metrics, including real grasping robot trials. The results show that Jacquard enables much better generalization skills than a human labeled dataset thanks to its diversity of objects and grasping positions. For the purpose of reproducible research in robotics, we are releasing along with the Jacquard dataset a web interface for researchers to evaluate the successfulness of their grasping position detections using our dataset.

PDF Abstract

Analysis of "Jacquard: A Large Scale Dataset for Robotic Grasp Detection"

The paper "Jacquard: A Large Scale Dataset for Robotic Grasp Detection," authored by Amaury Depierre, Emmanuel Dellandrea, and Liming Chen, presents a comprehensive approach to circumvent the data constraints commonly encountered in robotic grasp detection research. The authors address a critical challenge in this field: the requirement for large, labeled datasets to train deep neural networks (DNNs) for predicting graspable locations, which is frequently impractical due to the cost and time involved in generating such data.

Introduction

Robotic grasp detection using deep learning has traditionally depended on manually labeled datasets or data gathered through physical trials, both methods being resource-intensive. Automated data generation through simulation, while scalable, often compromises on realism or diversity. The Jacquard dataset introduced in this paper covers this gap by leveraging automated simulation while maintaining the diversity and realism needed for effective neural network training.

Dataset Generation Methodology

The Jacquard dataset is constructed by leveraging a subset of the ShapeNet CAD models, from which over one million unique grasp annotations were generated. The authors utilize a simulated environment mimicking real-world conditions to annotate grasp attempts. This approach benefits from the advantages of physics simulation to improve scalability and annotation diversity. The dataset comprises RGB-D images with annotations of successful grasp points generated through rigorous simulation protocols.

Neural Network Training and Generalization

Harnessing AlexNet, the authors trained a convolutional neural network (CNN) with the Jacquard dataset and assessed its capability to generalize to real-world images, contrasting its performance against the Cornell dataset, a well-established benchmark in the field. The Jacquard-trained network outperformed its counterpart, evidencing enhanced prediction accuracies and better generalization to previously unseen objects, as demonstrated through successful real-world robotic trials with diverse objects. Notably, when tested in real-world settings, the Jacquard-trained model achieved a grasp successful rate substantially higher than one trained solely on real-world datasets like Cornell.

Evaluation Criteria and Results

Alongside traditional rectangle metrics, the paper introduces a novel simulated grasp trial (SGT) evaluation criterion that aligns closely with realistic robotic performance by reenacting grasps within a simulation environment. This proposed criterion was instrumental in evaluating the generalization capabilities of neural networks trained on the Jacquard dataset.

Practical Implications and Future Directions

The implications of this work are significant for advancing practical robotic applications, as it demonstrates that a large simulated dataset can effectively train networks capable of reliable real-world performance. This advances automated grasp prediction strategies, potentially lowering the barriers to deploying robots in dynamically evolving settings where diverse object form factors are encountered.

As for future work, the authors propose expanding the dataset to encompass more complex scenes with multiple interacting objects and enhancing the grasp quality assessment metrics to refine the predictor's capability.

Conclusion

The paper contributes a substantial resource and methodology for robotic grasp detection research. By automating the data generation process while ensuring realistic and varied input, Jacquard provides a scalable alternative that retains the critical quality of training data, offering promising directions for further experimentation and deployment in real-world robotic systems.

PDF Markdown Bookmark Chat (Pro)

Authors (3)

Amaury Depierre (4 papers)
Emmanuel Dellandréa (13 papers)
Liming Chen (102 papers)

Citations (277)

View on Semantic Scholar