Same Object, Different Grasps: Data and Semantic Knowledge for Task-Oriented Grasping (2011.06431v2)

Published 12 Nov 2020 in cs.RO and cs.CV

Abstract: Despite the enormous progress and generalization in robotic grasping in recent years, existing methods have yet to scale and generalize task-oriented grasping to the same extent. This is largely due to the scale of the datasets both in terms of the number of objects and tasks studied. We address these concerns with the TaskGrasp dataset which is more diverse both in terms of objects and tasks, and an order of magnitude larger than previous datasets. The dataset contains 250K task-oriented grasps for 56 tasks and 191 objects along with their RGB-D information. We take advantage of this new breadth and diversity in the data and present the GCNGrasp framework which uses the semantic knowledge of objects and tasks encoded in a knowledge graph to generalize to new object instances, classes and even new tasks. Our framework shows a significant improvement of around 12% on held-out settings compared to baseline methods which do not use semantics. We demonstrate that our dataset and model are applicable for the real world by executing task-oriented grasps on a real robot on unknown objects. Code, data and supplementary video could be found at https://sites.google.com/view/taskgrasp

View on arXiv

Authors (5)

Adithyavairavan Murali (13 papers)
Weiyu Liu (22 papers)
Kenneth Marino (15 papers)
Sonia Chernova (60 papers)
Abhinav Gupta (178 papers)

Citations (49)

View on Semantic Scholar

Summary

Same Object, Different Grasps: Data and Semantic Knowledge for Task-Oriented Grasping

The paper "Same Object, Different Grasps: Data and Semantic Knowledge for Task-Oriented Grasping" introduces a novel approach to robotic grasping that integrates semantic knowledge to enhance task-oriented grasping capabilities. Despite the advancements made in robotic manipulation techniques, task-oriented grasping has lagged behind due to the limitations in existing datasets concerning the diversity of objects and tasks. This research seeks to bridge this gap with the introduction of the TaskGrasp dataset and the GCNGrasp framework, which is driven by semantic information encoded in knowledge graphs.

TaskGrasp Dataset

The TaskGrasp dataset is extensive, comprising 250,000 task-oriented grasps across 191 objects and 56 tasks, supported by RGB-D information. This dataset significantly surpasses previous datasets in both size and diversity, facilitating the paper of generalization in task-oriented grasping. The dataset's broad scope offers an opportunity to examine task-oriented grasping across various object instances, categories, and tasks, thus addressing the limitations of scale and multiplicity in past datasets.

GCNGrasp Framework

Central to this research is the GCNGrasp framework, which utilizes Graph Convolutional Networks (GCNs) to leverage semantic relationships embedded in a knowledge graph. This approach enables the model to generalize to new object instances, classes, and tasks. By incorporating semantic knowledge, encoded as object-task relations and hierarchies, the framework demonstrates improved performance compared to baseline models that do not incorporate semantic insight.

Evaluation and Results

The efficacy of the proposed GCNGrasp framework is substantiated by a notable improvement in task-oriented grasping, with approximately 12% better performance in held-out task settings compared to baselines lacking semantic integration. The model exhibits commendable zero-shot generalization in settings with unknown tasks and object categories, underscoring the advantage of semantic knowledge integration.

Implications and Future Developments

The implications of this research are twofold. Practically, it enhances robotic manipulation capabilities, enabling robots to perform a wider array of tasks in real-world environments with limited prior information. Theoretically, it showcases the potential of semantic knowledge integration for advancing robotic learning paradigms. Future developments may include the expansion of task-oriented datasets, refined semantic graph constructions, and improved transfer learning techniques for robotic grasping applications.

In summary, the integration of semantic knowledge through the GCNGrasp framework represents a significant step forward in task-oriented grasping research. The results suggest that this approach is promising for achieving more efficient and generalized grasping strategies in varied environments, paving the way for advanced studies and applications in intelligent robotic systems.

Related Papers

Find Related Papers

YouTube

Show All Videos