Appropriate Role of Generative Video Models in Robotic Manipulation
Determine the appropriate functional role that large generative video models should serve within robotic manipulation systems to effectively leverage their visual predictions given the embodiment gap between humans and robots. The goal is to ascertain how such models should be integrated into manipulation pipelines so their imagined object motions can be translated into executable robot actions.
Sponsor
References
Despite their promise, it remains unclear what role such models should serve in a robot manipulation system.
— Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
(2512.24766 - Dharmarajan et al., 31 Dec 2025) in Section 1 (Introduction)