Extending reinforcement learning to distilled autoregressive video models
Develop reinforcement learning techniques that can be effectively applied to highly efficient distilled autoregressive video models for streaming video generation alignment, rather than only to heavy pre-distilled teacher models.
References
Extending RL to highly efficient distilled AR video models remains an open problem.
— Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
(2603.17051 - Zhang et al., 17 Mar 2026) in Section 2.3, Related Work — Reinforcement Learning for Generative Models