DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps (2206.00927v3)

Published 2 Jun 2022 in cs.LG and stat.ML

Abstract: Diffusion probabilistic models (DPMs) are emerging powerful generative models. Despite their high-quality generation performance, DPMs still suffer from their slow sampling as they generally need hundreds or thousands of sequential function evaluations (steps) of large neural networks to draw a sample. Sampling from DPMs can be viewed alternatively as solving the corresponding diffusion ordinary differential equations (ODEs). In this work, we propose an exact formulation of the solution of diffusion ODEs. The formulation analytically computes the linear part of the solution, rather than leaving all terms to black-box ODE solvers as adopted in previous works. By applying change-of-variable, the solution can be equivalently simplified to an exponentially weighted integral of the neural network. Based on our formulation, we propose DPM-Solver, a fast dedicated high-order solver for diffusion ODEs with the convergence order guarantee. DPM-Solver is suitable for both discrete-time and continuous-time DPMs without any further training. Experimental results show that DPM-Solver can generate high-quality samples in only 10 to 20 function evaluations on various datasets. We achieve 4.70 FID in 10 function evaluations and 2.87 FID in 20 function evaluations on the CIFAR10 dataset, and a $4\sim 16\times$ speedup compared with previous state-of-the-art training-free samplers on various datasets.

Authors (6)

Cheng Lu (70 papers)
Yuhao Zhou (78 papers)
Fan Bao (30 papers)
Jianfei Chen (63 papers)
Chongxuan Li (75 papers)
Jun Zhu (424 papers)

Citations (1,042)

View on Semantic Scholar

Summary

DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps

The paper "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" explores enhancing the efficiency of sampling in Diffusion Probabilistic Models (DPMs). While DPMs are lauded for their high-quality generative capabilities across various tasks such as image and video generation, the existing methodologies suffer from a crippling inefficiency—requiring hundreds or thousands of sequential function evaluations. This work introduces DPM-Solver, a specialized ODE-based solver designed to significantly expedite the sampling process.

Background

Diffusion Probabilistic Models are generative models that progressively refine noisy data samples back to the desired distribution through a forward and reverse process defined by either discrete-time or continuous-time stochastic differential equations (SDEs). However, the conventional sampling methods for DPMs demand numerous function evaluations of the underlying models, making them prohibitively slow.

Key Contributions

Exact Formulation for Diffusion ODE Solutions: The paper proposes a novel mathematical formulation to solve diffusion ODEs. This formulation analytically decomposes the linear component of the ODE solution, bypassing the need for black-box ODE solvers traditionally used in previous works. By transforming the solution into an exponentially weighted integral of the model's neural network, the computational efficiency is significantly improved.
High-Order Dedicated Solver: Based on the new formulation, the authors introduce DPM-Solver, which includes first-order, second-order, and third-order variants to ensure high-order convergence. This dedicated solver leverages exponential integrators' numerical methods to approximate the aforementioned integral efficiently.
Adaptive Step Size Schedule: The proposed DPM-Solver also incorporates an adaptive step size mechanism. This method dynamically adjusts the step size, ensuring optimal sampling efficiency and quality. This adaptive approach further consolidates the solver's applicability across various DPM scenarios.

Experimental Results

The empirical evaluation demonstrates that DPM-Solver can produce high-quality samples within 10–20 function evaluations across multiple datasets such as CIFAR-10, CelebA, and ImageNet. Specifically, results on the CIFAR-10 dataset show that DPM-Solver achieves an FID of 4.70 with just 10 function evaluations, a substantial improvement compared to existing solvers that require hundreds of steps for similar performance levels. For instance, the paper highlights a $4\sim 16\times$ speedup in sampling time over previous state-of-the-art methods.

Implications

Practical Implications

Efficiency: The proposed solver drastically reduces the computational cost, making DPMs more viable for real-time applications.
Applicability: DPM-Solver is directly applicable to both discrete-time and continuous-time DPMs without requiring any additional training. This makes it versatile across various practical generative tasks, from image synthesis to text-to-image conversions.
Ease of Integration: The solver can be integrated as a plug-and-play component in existing models, facilitating swift transitions to faster sampling without a complete overhaul of the model architecture.

Theoretical Implications

Enhanced Understanding of Diffision ODEs: By providing a more granular solution formulation for diffusion ODEs, the paper paves the way for a deeper understanding of DPM dynamics and the role of neural networks in generative processes.
Framework for Further Optimizations: The introduction of high-order methods and adaptive step sizing furnishes a framework that can be expanded for even more sophisticated DPMs in the future.

Future Developments

The paper opens several avenues for future exploration:

Extending the solver's applicability to other types of foundational models beyond DPMs, such as GANs or VAEs.
Investigating even higher-order solvers that might bring additional speed and accuracy benefits.
Developing hybrid solvers that combine the strengths of DPM-Solver with existing methods like training-based approaches for even faster and flexible generative models.

In conclusion, "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" is a significant contribution that addresses the critical bottleneck of sampling efficiency in DPMs. By introducing a specialized, high-order solver with adaptive step sizing, it marks a pivotal step forward in making generative models both high-performing and computationally efficient.

PDF Markdown

Related Papers

Find Related Papers