Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Part-Guided 3D RL for Sim2Real Articulated Object Manipulation (2404.17302v1)

Published 26 Apr 2024 in cs.RO, cs.AI, and cs.CV

Abstract: Manipulating unseen articulated objects through visual feedback is a critical but challenging task for real robots. Existing learning-based solutions mainly focus on visual affordance learning or other pre-trained visual models to guide manipulation policies, which face challenges for novel instances in real-world scenarios. In this paper, we propose a novel part-guided 3D RL framework, which can learn to manipulate articulated objects without demonstrations. We combine the strengths of 2D segmentation and 3D RL to improve the efficiency of RL policy training. To improve the stability of the policy on real robots, we design a Frame-consistent Uncertainty-aware Sampling (FUS) strategy to get a condensed and hierarchical 3D representation. In addition, a single versatile RL policy can be trained on multiple articulated object manipulation tasks simultaneously in simulation and shows great generalizability to novel categories and instances. Experimental results demonstrate the effectiveness of our framework in both simulation and real-world settings. Our code is available at https://github.com/THU-VCLab/Part-Guided-3D-RL-for-Sim2Real-Articulated-Object-Manipulation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Pengwei Xie (53 papers)
  2. Rui Chen (310 papers)
  3. Siang Chen (10 papers)
  4. Yuzhe Qin (37 papers)
  5. Fanbo Xiang (14 papers)
  6. Tianyu Sun (14 papers)
  7. Jing Xu (244 papers)
  8. Guijin Wang (23 papers)
  9. Hao Su (219 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.