2000 character limit reached
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation (2401.04468v1)
Published 9 Jan 2024 in cs.CV and cs.AI
Abstract: The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.
- Gen-2. https://research.runwayml.com/gen2. Accessed: 2023-11-16.
- MoonValley. https://https://moonvalley.ai/. Accessed: 2023-11-16.
- Morph. https://www.morphstudio.com/. Accessed: 2023-11-16.
- Pika 1.0. https://pika.art/. Accessed: 2023-12-26.
- SVD-XT. https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt. Accessed: 2023-11-27.
- Stable video diffusion: Scaling latent video diffusion models to large datasets, 2023.
- Multiple video frame interpolation via enhanced deformable separable convolution, 2021.
- Ldmvfi: Video frame interpolation with latent diffusion models, 2023.
- Emu video: Factorizing text-to-video generation by explicit image conditioning, 2023.
- Animatediff: Animate your personalized text-to-image diffusion models without specific tuning, 2023.
- Videopoet: A large language model for zero-shot video generation, 2023.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Extracting motion and appearance via inter-frame attention for efficient video frame interpolation. In CVPR, 2023a.
- Adding conditional control to text-to-image diffusion models, 2023b.
- Magicvideo: Efficient video generation with latent diffusion models, 2023.
- Weimin Wang (52 papers)
- Jiawei Liu (156 papers)
- Zhijie Lin (30 papers)
- Jiangqiao Yan (3 papers)
- Shuo Chen (127 papers)
- Chetwin Low (2 papers)
- Tuyen Hoang (3 papers)
- Jie Wu (230 papers)
- Jun Hao Liew (29 papers)
- Hanshu Yan (28 papers)
- Daquan Zhou (47 papers)
- Jiashi Feng (295 papers)