Learning Effective Physical Representations for Video Generation
Develop a principled methodology to learn an effective physical representation suitable for conditioning video generation models, given the absence of a well-established definition of physical representation and the lack of straightforward supervision signals for training such a representation.
References
However, how to learn an effective physical representation for video generation remains an open question.
— PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning
(2510.13809 - Ji et al., 15 Oct 2025) in Section 1 (Introduction)