Grasp Pose Detection in Point Clouds (1706.09911v1)

Published 29 Jun 2017 in cs.RO

Abstract: Recently, a number of grasp detection methods have been proposed that can be used to localize robotic grasp configurations directly from sensor data without estimating object pose. The underlying idea is to treat grasp perception analogously to object detection in computer vision. These methods take as input a noisy and partially occluded RGBD image or point cloud and produce as output pose estimates of viable grasps, without assuming a known CAD model of the object. Although these methods generalize grasp knowledge to new objects well, they have not yet been demonstrated to be reliable enough for wide use. Many grasp detection methods achieve grasp success rates (grasp successes as a fraction of the total number of grasp attempts) between 75% and 95% for novel objects presented in isolation or in light clutter. Not only are these success rates too low for practical grasping applications, but the light clutter scenarios that are evaluated often do not reflect the realities of real world grasping. This paper proposes a number of innovations that together result in a significant improvement in grasp detection performance. The specific improvement in performance due to each of our contributions is quantitatively measured either in simulation or on robotic hardware. Ultimately, we report a series of robotic experiments that average a 93% end-to-end grasp success rate for novel objects presented in dense clutter.

Authors (4)

Andreas ten Pas (9 papers)
Marcus Gualtieri (10 papers)
Kate Saenko (178 papers)
Robert Platt (70 papers)

Citations (510)

View on Semantic Scholar

Summary

Grasp Pose Detection in Point Clouds: An Expert Overview

The paper authored by ten Pas et al., titled "Grasp Pose Detection in Point Clouds," addresses significant challenges in robotic grasping by proposing advanced methods for grasp detection directly from sensor data, specifically point clouds, without relying on object pose estimation. This methodological shift treats grasp perception in a manner akin to object detection in computer vision, thereby circumventing the traditional dependence on CAD models.

Contributions and Methodology

The paper details three primary algorithmic contributions that improve grasp detection accuracy, especially in cluttered environments:

Grasp Hypothesis Generation: The proposed method generates hypotheses without requiring precise object segmentation, allowing for hypothesis generation on any visible surface. This flexibility is critical for handling densely cluttered scenarios.
Grasp Descriptor Incorporating Surface Normals: The new descriptor enhances grasp classification accuracy by approximately 10% compared to prior methods. It incorporates surface normals and multiple viewing angles, representing a significant improvement in capturing local geometric properties relevant for grasping.
Incorporation of Prior Knowledge: By integrating prior knowledge about object categories, the paper reports an additional 2% boost in grasp classification accuracy. This leverages pre-existing information to refine the classification process.

The paper employs a CNN to classify grasp candidates derived from a voxelized and heightmapped representation of the observed 3D volume. By projecting these into multiple 2D views, the approach models the complex 3D grasp scenarios effectively.

Evaluation and Results

Significantly, the approach averages a 93% end-to-end grasp success rate in dense clutter conditions, marking a pivotal advancement when compared to existing methods achieving 75% to 95% under isolated or lightly cluttered conditions. The dense clutter benchmark task substantiates these results, demonstrating practical applicability.

The paper further examines the effects of robust candidate sampling techniques, representation strategies, and pretraining on classification performance. These considerations highlight the system's ability to maintain high-precision recall, particularly vital for minimizing false positive grasps in practical deployments.

Implications and Future Directions

The innovations presented have implications for both practical applications and the theoretical advancement of grasp detection. Practically, the enhanced grasp detection system can be integrated into robotic systems operating in less controlled, real-world environments, such as warehouses or manufacturing facilities with heavily cluttered settings.

Theoretically, the work prompts further exploration into the balance between object-centric and grasp-centric approaches. As object category predictions inform grasp selection, integrating advanced object detection models holds promise for synergistic improvements in robotic manipulation.

Future research may explore the integration of semantic understanding, potentially using additional sensory modalities to enhance decision-making and further close the gap towards autonomous, versatile robotic systems.

By advancing grasp detection reliability without reliance on detailed object models or precise segmentation, this work makes a substantial contribution to the domain of robotic perception and manipulation, opening new avenues for robotics in complex environments.

PDF Markdown

Related Papers

Find Related Papers