Third-Order DPM-Solver++ Overview
- Third-order DPM-Solver++ is a conceptual extension of the DPM-Solver++ framework, pushing beyond the second-order limit to achieve higher global accuracy in ODE solvers for diffusion models.
- The methodology builds on high-order Taylor expansions and adaptive step size control, similar to DPM-Solver-23, to potentially reduce function evaluations while maintaining theoretical accuracy.
- Practical implementations recommend second-order methods for guided sampling due to observed instabilities at high guidance scales, leaving third-order derivatives as a promising open research problem.
Third-order DPM-Solver++ refers to the (hypothetical) extension of the DPM-Solver++ algorithm, a class of ODE solvers for efficient sampling in diffusion probabilistic models (DPMs), to third-order global accuracy. While DPM-Solver++ is designed to accelerate both unguided and guided sampling from DPMs by leveraging high-order numerical integration, the DPM-Solver++ framework as described in the foundational literature only explicitly establishes and analyzes solvers up to second order (k=2), with the authors cautioning that solvers of order k ≥ 3 remain an open research problem, particularly due to observed instabilities at high guidance scales (Lu et al., 2022). In contrast, the third-order high-accuracy adaptive solver is fully detailed for the original DPM-Solver family (Lu et al., 2022), encompassing both theoretical derivation and implementation aspects.
1. Theoretical Foundation and ODE Formulation
DPM-Solver++ operates on the continuous-time denoising ODE associated with diffusion probabilistic models. This ODE is parameterized either by a noise prediction model or a data prediction model :
- Noise prediction parameterization:
- Data prediction parameterization:
Here, is the diffusion time, and relate to the variance-preserving (VP) SDE parameters, and (SNR) is monotonically decreasing (Lu et al., 2022). The ODE admits an exact integral solution that forms the basis for constructing high-order numerical solvers (Lu et al., 2022).
2. High-Order ODE Solvers: DPM-Solver and DPM-Solver++
The DPM-Solver framework provides a general recipe for constructing explicit high-order, stiffly accurate exponential integrators for the denoising ODE. For order-, this involves a Taylor expansion of the neural network prediction () up to derivatives in a log-SNR parameterization ():
- General update:
where are exponential integrator kernels, , and denotes the -th derivative at (Lu et al., 2022).
DPM-Solver++ adapts these ideas, employing the data-prediction parameterization, dynamic thresholding (clipping to bounded ranges), and multistep formulations to address instabilities observed in guided sampling (Lu et al., 2022). However, the literature provides explicit derivations only for (second order), not .
3. Third-Order Methods: DPM-Solver vs. DPM-Solver++
The third-order DPM-Solver (DPM-Solver-3) is fully developed in (Lu et al., 2022), offering explicit update formulas, rigorous global accuracy proofs, and practical pseudocode. The solver uses intermediate sub-steps at and of each time interval to avoid explicit computation of neural network derivatives, and these sub-steps are analytically constructed to satisfy all order conditions. The essential steps are summarized in the following table:
| Algorithmic Component | DPM-Solver-3 | DPM-Solver++ (k=3) |
|---|---|---|
| Update formula | Explicit, with two intermediate sub-steps | Not present in literature |
| Pseudocode | Fully specified | Not specified for |
| Global accuracy proof | Not available for | |
| Guidance/thresholding | Works for basic and moderate guidance | Instability at large guidance, not pursued (Lu et al., 2022) |
A plausible implication is that, although the generalized Taylor-expansion template for arbitrary is shown for DPM-Solver++ [(Lu et al., 2022), Eq. (7)], explicit construction of a third-order method remains prospective, and there are no published algorithms, error analyses, or empirical studies for DPM-Solver++ at .
4. Stability and Instability for High-Order Solvers
Empirical evidence in (Lu et al., 2022) demonstrates that high-order (k ≥ 3) solvers—whether from DPM-Solver, DEIS, or other families—exhibit severe numerical instability when applied to guided sampling with large classifier or classifier-free guidance scales (e.g., ). In such regimes, the solvers can produce divergence or blows up in practice, especially when attempting aggressive NFE (number of function evaluations) reduction to . This is attributed to the Lipschitz constants and local nonlinearity being amplified by the guidance signal and cannot be simply remedied via thresholding or naive step size reduction. Thus, DPM-Solver++ restricts its practical implementations and theoretical development to (Lu et al., 2022).
5. Adaptive Step Size and “++” Enhancements
Adaptive step size control (“++” enhancement) is constructed by embedding lower- and higher-order solvers (e.g., orders 2 and 3) in a stepwise local error estimate. DPM-Solver-23 performs both second- and third-order updates, forming an error estimate and updating . Steps are accepted when ; otherwise, is reduced and retried [(Lu et al., 2022), Appendix D.2]. This method yields 20–50% reductions in total NFE while maintaining full third-order accuracy. DPM-Solver++ in its current published form does not extend this adaptive machinery to third order due to the aforementioned stability problems for guidance and lack of an explicit scheme (Lu et al., 2022).
6. Practical Implications and Implementation Recommendations
- DPM-Solver++ as scientifically established supports single- and multistep methods up to . For classifier-free guidance scales , multistep second-order integration is recommended as it reduces effective step size per empirical findings (Lu et al., 2022).
- Uniform (time) scheduling outperforms uniform (log-SNR) based grids in practical DPM-Solver++ usage; when thresholding is used, data-prediction models () should be preferred.
- For the original DPM-Solver framework, both fixed and adaptive third-order integration works robustly in basic DPM sampling, but guidance scenarios at high scales are not addressed.
- There is no published algorithm, pseudocode, or error analysis for DPM-Solver++ at .
7. Comparison with Prior High-Order Solvers and Open Directions
Comparative benchmarking in (Lu et al., 2022) establishes that second-order DPM-Solver++ consistently outperforms first-order methods (e.g., DDIM) at low NFE regimes ( steps) and matches or exceeds the sample quality of PNDM and similar high-order solvers, particularly when used with multistep control and dynamic thresholding. In contrast, third-order methods from prior work (DPM-Solver, DEIS) underperform at large guidance amplitudes, sometimes trailing behind even the basic first-order samplers. The extension of DPM-Solver++ to third-order and beyond, especially with empirical stability for guided DPM sampling, is explicitly left open for future research (Lu et al., 2022).