Training to improve planning reliability and extend the effective horizon in video diffusion models
Determine whether training approaches can produce video diffusion models that plan more reliably and over longer horizons than currently observed, overcoming the sharp failure cliff on trajectories exceeding the single-generation window.
References
Whether early commitment and horizon limitations manifest similarly in non-spatial reasoning modalities, and whether training can produce models that plan more reliably or over longer horizons, are important open questions.
— Video Models Reason Early: Exploiting Plan Commitment for Maze Solving
(2603.30043 - Newman et al., 31 Mar 2026) in Section 8 (Conclusion), final paragraph