Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation (2108.02425v2)

Published 5 Aug 2021 in cs.RO and cs.CV

Abstract: Grasping in cluttered scenes has always been a great challenge for robots, due to the requirement of the ability to well understand the scene and object information. Previous works usually assume that the geometry information of the objects is available, or utilize a step-wise, multi-stage strategy to predict the feasible 6-DoF grasp poses. In this work, we propose to formalize the 6-DoF grasp pose estimation as a simultaneous multi-task learning problem. In a unified framework, we jointly predict the feasible 6-DoF grasp poses, instance semantic segmentation, and collision information. The whole framework is jointly optimized and end-to-end differentiable. Our model is evaluated on large-scale benchmarks as well as the real robot system. On the public dataset, our method outperforms prior state-of-the-art methods by a large margin (+4.08 AP). We also demonstrate the implementation of our model on a real robotic platform and show that the robot can accurately grasp target objects in cluttered scenarios with a high success rate. Project link: https://openbyterobotics.github.io/sscl

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yiming Li (200 papers)
  2. Tao Kong (49 papers)
  3. Ruihang Chu (18 papers)
  4. Yifeng Li (22 papers)
  5. Peng Wang (834 papers)
  6. Lei Li (1293 papers)
Citations (44)

Summary

We haven't generated a summary for this paper yet.