Extending WorldPlay to longer durations, multi-agent interaction, complex physical dynamics, and broader action types
Investigate methods to extend the WorldPlay autoregressive streaming video diffusion framework to support generation of longer-duration videos, enable multi-agent interactions within the simulated environments, incorporate more complex physical dynamics, and expand the set of controllable action types beyond the current design.
Sponsor
References
While WorldPlay demonstrates strong performance, extending the framework to generate videos with longer durations, multi-agent interactions, and more complex physical dynamics still requires further investigation. Moreover, Expanding the action types to a broader set is another promising direction. These challenges remain open for future research.
— WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
(2512.14614 - Sun et al., 16 Dec 2025) in Section: Limitations