Balancing fast inference and multi-modality in diffusion-based robotic policies
Determine algorithmic strategies for diffusion-based visuomotor policies that enable fast inference while preserving and promoting multi-modal action distributions in complex robotic manipulation settings, resolving the trade-off between stochastic reverse-SDE sampling (diverse but slow) and deterministic probability-flow ODE distillation (fast but prone to mode collapse).
Sponsor
References
Consequently, an open problem is how diffusion-based methods can retain fast inference while preserving and promoting multi-modality in complex robotic settings.
— Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation
(2510.26670 - Zhao et al., 30 Oct 2025) in Section 1 (Introduction)