Multi-Shot and Scene-Transition Video Generation
Develop text-to-video generation methods capable of synthesizing videos that consist of multiple shots and include explicit scene transitions, extending beyond single-shot clips while preserving visual quality and temporal coherence.
References
As for limitations, our method is not designed to generate videos that consist of multiple shots, or that involve transitions between scenes. Generating such content remains an open challenge for future research.
— Lumiere: A Space-Time Diffusion Model for Video Generation
(2401.12945 - Bar-Tal et al., 23 Jan 2024) in Section 6, Conclusion