Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation (2406.01586v1)

Published 3 Jun 2024 in cs.RO and cs.AI

Abstract: Diffusion models have been verified to be effective in generating complex distributions from natural images to motion trajectories. Recent diffusion-based methods show impressive performance in 3D robotic manipulation tasks, whereas they suffer from severe runtime inefficiency due to multiple denoising steps, especially with high-dimensional observations. To this end, we propose a real-time robotic manipulation model named ManiCM that imposes the consistency constraint to the diffusion process, so that the model can generate robot actions in only one-step inference. Specifically, we formulate a consistent diffusion process in the robot action space conditioned on the point cloud input, where the original action is required to be directly denoised from any point along the ODE trajectory. To model this process, we design a consistency distillation technique to predict the action sample directly instead of predicting the noise within the vision community for fast convergence in the low-dimensional action manifold. We evaluate ManiCM on 31 robotic manipulation tasks from Adroit and Metaworld, and the results demonstrate that our approach accelerates the state-of-the-art method by 10 times in average inference speed while maintaining competitive average success rate.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Guanxing Lu (7 papers)
  2. Zifeng Gao (2 papers)
  3. Tianxing Chen (12 papers)
  4. Wenxun Dai (9 papers)
  5. Ziwei Wang (128 papers)
  6. Yansong Tang (81 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.