Long-horizon identity and dynamics stability in image-to-video generation
Establish image-to-video generation mechanisms that ensure persistent agent/object identity and stable motion dynamics across long-horizon interactive tasks in VBVR-Bench, specifically preventing duplication and flickering while maintaining coherent, legal paths; this is exemplified by failures observed for VBVR-Wan2.2 on the Multiple Keys for One Door maze task (G-47).
References
However, it can still suffer from control failures such as agent duplication/flickering when traversing a coherent path, indicating that maintaining identity and stable dynamics over long horizons remains an open problem.
— A Very Big Video Reasoning Suite
(2602.20159 - Wang et al., 23 Feb 2026) in Section 6.3 (Qualitative Analysis), Limitations and failure modes (VBVR-Wan2.2)