2000 character limit reached
AnimateDiff-Lightning: Cross-Model Diffusion Distillation (2403.12706v1)
Published 19 Mar 2024 in cs.CV and cs.AI
Abstract: We present AnimateDiff-Lightning for lightning-fast video generation. Our model uses progressive adversarial diffusion distillation to achieve new state-of-the-art in few-step video generation. We discuss our modifications to adapt it for the video modality. Furthermore, we propose to simultaneously distill the probability flow of multiple base diffusion models, resulting in a single distilled motion module with broader style compatibility. We are pleased to release our distilled AnimateDiff-Lightning model for the community's use.
- Frozen in time: A joint video and image encoder for end-to-end retrieval. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 1708–1718, 2021.
- Stable video diffusion: Scaling latent video diffusion models to large datasets, 2023.
- Align your latents: High-resolution video synthesis with latent diffusion models. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 22563–22575, 2023.
- Structure and content-guided video synthesis with diffusion models. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pages 7312–7322, 2023.
- Generative adversarial networks. Communications of the ACM, 63:139 – 144, 2014.
- Animatediff: Animate your personalized text-to-image diffusion models without specific tuning. In The Twelfth International Conference on Learning Representations, 2024.
- Gaussian error linear units (gelus), 2023.
- Imagen video: High definition video generation with diffusion models, 2022.
- Denoising diffusion probabilistic models. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020.
- LoRA: Low-rank adaptation of large language models. In International Conference on Learning Representations, 2022.
- Consistency trajectory models: Learning probability flow ODE trajectory of diffusion. In The Twelfth International Conference on Learning Representations, 2024.
- Common diffusion noise schedules and sample steps are flawed. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 5404–5411, January 2024.
- Sdxl-lightning: Progressive adversarial diffusion distillation, 2024.
- Robust high-resolution video matting with temporal guidance. 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 3132–3141, 2021.
- Diffusion model with perceptual loss, 2024.
- Flow matching for generative modeling. In The Eleventh International Conference on Learning Representations, 2023.
- Flow straight and fast: Learning to generate and transfer data with rectified flow, 2022.
- Instaflow: One step is enough for high-quality diffusion-based text-to-image generation. In The Twelfth International Conference on Learning Representations, 2024.
- Dpm-solver++: Fast solver for guided sampling of diffusion probabilistic models, 2023.
- Latent consistency models: Synthesizing high-resolution images with few-step inference, 2023.
- Lcm-lora: A universal stable-diffusion acceleration module, 2023.
- T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models, 2023.
- Pytorch: An imperative style, high-performance deep learning library, 2019.
- Dreamfusion: Text-to-3d using 2d diffusion. In The Eleventh International Conference on Learning Representations, 2023.
- Searching for activation functions, 2017.
- High-resolution image synthesis with latent diffusion models. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10674–10685, 2021.
- U-net: Convolutional networks for biomedical image segmentation. ArXiv, abs/1505.04597, 2015.
- Progressive distillation for fast sampling of diffusion models. In International Conference on Learning Representations, 2022.
- Adversarial diffusion distillation, 2023.
- Make-a-video: Text-to-video generation without text-video data. In The Eleventh International Conference on Learning Representations, 2023.
- Improved techniques for training consistency models. In The Twelfth International Conference on Learning Representations, 2024.
- Consistency models. In International Conference on Machine Learning, 2023.
- Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations, 2021.
- Towards accurate generative models of video: A new metric & challenges. ArXiv, abs/1812.01717, 2018.
- Animatelcm: Accelerating the animation of personalized diffusion models and adapters with decoupled consistency learning, 2024.
- Magicvideo-v2: Multi-stage high-aesthetic video generation, 2024.
- Group normalization. International Journal of Computer Vision, 128:742 – 755, 2018.
- Holistically-nested edge detection. International Journal of Computer Vision, 125:3 – 18, 2015.
- Effective whole-body pose estimation with two-stages distillation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4210–4220, 2023.
- Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models, 2023.
- One-step diffusion with distribution matching distillation, 2023.
- Adding conditional control to text-to-image diffusion models. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3813–3824, 2023.
- Trajectory consistency distillation, 2024.
- Magicvideo: Efficient video generation with latent diffusion models, 2023.
- AbsoluteReality v1.8.1. https://civitai.com/models/81458.
- Counterfeit v3.0. https://civitai.com/models/4468.
- DreamShaper v8. https://civitai.com/models/4384.
- DynaVision v2. https://civitai.com/models/75549.
- epiCRealism. https://civitai.com/models/25694.
- Exquisite Details Art. https://civitai.com/models/118495.
- IMP v1.0. https://civitai.com/models/56680.
- MajicMix Realistic v7. https://civitai.com/models/43331.
- MajicMix Reverie v1. https://civitai.com/models/65055.
- Mistoon Anime v1.0. https://civitai.com/models/24149.
- RCNZ Cartoon 3d v2. https://civitai.com/models/66347.
- Realistic Vision v5.1. https://civitai.com/models/4201.
- ReV Animated v1.2.2. https://civitai.com/models/7371.
- ToonYou Beta 6. https://civitai.com/models/30240.
- Shanchuan Lin (17 papers)
- Xiao Yang (158 papers)