Simultaneous real-time efficiency and high quality in purely autoregressive long video generation
Develop purely autoregressive (AR) long video generation models and training procedures that simultaneously achieve real-time inference efficiency and maintain high visual quality over long horizons in text-to-video generation.
References
Despite the promise of purely AR for long video generation, achieving real-time efficiency and maintaining high quality simultaneously remains an open challenge.
— LongLive: Real-time Interactive Long Video Generation
(2509.22622 - Yang et al., 26 Sep 2025) in Appendix, Section "General Related Work", subsection "Autoregressive Long Video Generation"