Denoising Diffusion Implicit Models (DDIM)

Updated 19 January 2026

DDIM is a deterministic, non-Markovian sampler that reparameterizes diffusion models to accelerate reverse generation while preserving sample fidelity.
It builds on standard DDPM training by using a noise scale of zero, yielding reproducible outputs and flexible conditioning without retraining.
Empirical studies show DDIM achieves up to 100× speedup in tasks like image synthesis and tabular imputation with only marginal quality degradation.

Denoising Diffusion Implicit Models (DDIM) define a class of non-Markovian generative samplers for diffusion-based probabilistic modeling. Introduced to accelerate the sampling process in denoising diffusion probabilistic models (DDPM), DDIM achieves high-quality sample generation at a fraction of the computational cost required by Markovian reverse chains, while retaining the original training paradigms and offering new algorithmic flexibility. The deterministic, implicit nature of DDIM updates produces efficient and reproducible generation, enabling applications across domains such as image synthesis, tabular data imputation, inverse problems, and structural design.

1. Theoretical Formulation and Sampling Principle

DDIM emerges from a re-examination of the DDPM framework, which defines a forward (noising) process

$q(x_t | x_{t-1}) = \mathcal{N}\left(x_t; \sqrt{\alpha_t} x_{t-1}, (1-\alpha_t)I \right)$

and a reverse (denoising) process for sample generation. By focusing on marginals $q(x_t|x_0)$ rather than just stepwise transitions, DDIM enables a reparameterization of the reverse process as an implicit, possibly non-Markovian chain. The update for the deterministic case (noise scale $\sigma_t = 0$ ) is: $x_{t-1} = \sqrt{\bar{\alpha}_{t-1}}\,\hat{x}_0(x_t) + \sqrt{1-\bar{\alpha}_{t-1}}\,\epsilon_\theta(x_t, t)$ with

$\hat{x}_0(x_t) = \frac{x_t - \sqrt{1-\bar{\alpha}_t} \, \epsilon_\theta(x_t, t)}{ \sqrt{\bar{\alpha}_t} }$

where $\epsilon_\theta$ is a neural network trained as in DDPM, and $\bar{\alpha}_t = \prod_{i=1}^t \alpha_i$ .

By appropriate subsampling of the time sequence $\{t\}$ , DDIM can generate samples in $S \ll T$ steps, typically achieving orders-of-magnitude speedup in wall-clock inference time relative to standard DDPM sampling (Song et al., 2020, Shah et al., 2024).

2. Deterministic and Non-Markovian Generative Trajectories

A central feature of DDIM is its non-Markovian implicit sampler, which arises by setting the noise scale to zero ( $\sigma_t=0$ ), resulting in a completely deterministic trajectory for a given $x_T$ . This yields backward maps $x_{t-1} = f_\theta(x_t,t)$ that no longer require the stochastic sampling at each step characteristic of DDPM chains. This property enables precise sample reproducibility and makes the method suitable for applications requiring stable outputs, such as tabular imputation and data augmentation (Zhou et al., 5 Aug 2025, Zhang et al., 2024).

When subsampling is introduced (i.e., skipping steps according to a coarser time grid), the DDIM update remains valid, preserving the correct marginal structure and further accelerating sampling. The update formula is robust to both linear and nonlinear diffusion schedules, and is recovered from continuous SDE/ODE perspectives as a first-order exponential integrator of the so-called probability flow ODE (Zhang et al., 2022, Permenter et al., 2023).

3. Training Paradigm and Conditional Extensions

DDIM relies on the standard DDPM training paradigm, where $\epsilon_\theta$ is optimized via a denoising score-matching loss: $\mathcal{L}(\theta) = \mathbb{E}_{x_0, \epsilon, t} \left[ \|\epsilon - \epsilon_\theta(x_t, t)\|^2_2 \right], \qquad x_t = \sqrt{\bar{\alpha}_t} x_0 + \sqrt{1 - \bar{\alpha}_t} \epsilon$ No retraining or modification to the objective is needed to deploy DDIM as a replacement for ancestral DDPM sampling (Song et al., 2020). Extensions to the conditional setting are direct: conditioning information (e.g., class labels, observed feature vectors, or physical parameters) is input to $\epsilon_\theta$ at both training and inference time.

This framework underpins scalable tabular data imputers, where missing values are denoised deterministically while observed features are clamped throughout the reverse chain, as in MissDDIM (Zhou et al., 5 Aug 2025). For classifier-free guided generation, a convex combination of conditional/unconditional noise predictions is employed to amplify the influence of specific conditioning variables during inference (Shah et al., 2024).

4. Computational Efficiency, Latency, and Sample Quality

The principal motivation for DDIM is acceleration of the reverse generation process. Empirical studies across image and tabular benchmarks demonstrate the following paradigm:

Sampler	Steps per Sample	# Samples Averaged	Typical FID (ImageNet, 10 steps)	Wall-clock/speedup
DDPM	1000	1	10-50	Baseline ( $1\times$ )
DDIM	10–50	1	10–14	$10\times$ – $100\times$ faster
GMM-DDIM	10–50	1	6.94 (10 steps)	$0.9\times$ – $1.1\times$ DDIM
DDIM-based Imputer	20–100	1	RMSE $0.3051$ (tabular 30% miss)	$5\times$ – $10\times$ faster than stochastic methods

Reducing the number of reverse steps $T \to S$ can degrade sample quality, but the deterministic structure of DDIM ensures that degradation is modest and often outperformed more substantially by DDPM-like schemes when $S \ll T$ (Song et al., 2020, Gabbur, 2023, Zhou et al., 5 Aug 2025).

In tabular imputation, MissDDIM matches or exceeds the imputation performance (RMSE, weighted F1, or MAE) of stochastic DDPM-style imputers, while eliminating sampling variance and reducing inference by an order of magnitude (Zhou et al., 5 Aug 2025). In generative design optimization (e.g., AIBIM), DDIM enables $100\times$ reduction in design generation time while preserving output quality (He et al., 2024).

5. Algorithmic Extensions and Multi-Domain Applications

Recent research extends the DDIM principle in multiple directions:

Moment-Matching Reverse Kernels: Replacing the single Gaussian transition in DDIM with a Gaussian mixture model (GMM) whose parameters are chosen by moment-matching yields improved sample quality, particularly in few-step regimes (FID: 6.94 with GMM-DDIM vs. 10.15 for DDIM at 10 steps on ImageNet 256x256) (Gabbur, 2023).
Shortest-Path Residual Optimization: Framing the denoising process as a shortest path in residual error space, ShortDF uses relaxation steps akin to Bellman–Ford updates to minimize accumulated reconstruction error and achieve further reduction in required steps (e.g., 2 steps with FID=9.08 on CIFAR-10, vs. 10 steps, FID=11.14 for vanilla DDIM) (Chen et al., 5 Mar 2025).
Conditional and Constrained Generation: DDIM underpins efficient solvers for linear inverse problems, multivariate imputation, and image inpainting, where its deterministic nature is leveraged to enforce constraints on outputs either exactly in noiseless settings or as moment/KL-matched distributions in the presence of noise (Jayaram et al., 2024, Zhang et al., 2023).
Generalized Diffusions and ODE Solvers: The gDDIM framework enables deterministic acceleration for non-isotropic diffusion models, including those with nonlinear SDE structure, via exponential-integrator updates and matrix-reparameterized scores (e.g., blurring and critically-damped Langevin processes) (Zhang et al., 2022).
Spiking Neural Network Realizations: FSDDIM adapts DDIM updates for spiking neural network architectures, combining synaptic current learning with deterministic denoising to yield the first fully-spiking diffusion generator, surpassing all prior spiking baselines in FID/FAD and reducing arithmetic cost by $>70\%$ (Watanabe et al., 2023).

6. Empirical Results, Benchmarks, and Practical Insights

Extensive benchmarking on image generation (CIFAR-10, ImageNet, FFHQ), tabular UCI/Kaggle datasets, medical imaging (MS lesion filling/synthesis), and inverse problems demonstrates that DDIM delivers an optimal trade-off between speed and sample fidelity under deterministic or near-deterministic settings.

In image generation, DDIM efficiently traverses $S\ll T$ steps with marginal increase in FID (e.g., FID ≈4.2 for $S=100$ , ≈6.8 for $S=20$ on CIFAR-10 (Song et al., 2020)), with further gains under moment-matched or shortest-path relaxation strategies (Gabbur, 2023, Chen et al., 5 Mar 2025).
In tabular imputation, MissDDIM achieves RMSE $0.3051$ (30% missing) and weighted F1 outperforming stochastic CSDI/TabCSDI/MissDiff, with runtime reduction by $10\times$ (Zhou et al., 5 Aug 2025).
In conditional and constrained settings, DDIM variants support exact constraint satisfaction, coherent inpainting, or hybrid plug-and-play restoration, providing performance superior to or on par with ancestral or Langevin-based approaches at a small fraction of the computation (Jayaram et al., 2024, Mbakam et al., 2024, Zhang et al., 2023).

A key practical recommendation is to choose the smallest $S$ (sampling steps) that yields target fidelity for the downstream task, leverage deterministic mode (i.e., zero additional stochasticity), and use noise schedules (e.g., cosine or learned schedules) suited to the modality and desired trade-off (Shah et al., 2024, He et al., 2024).

7. Limitations and Future Research Directions

Despite its acceleration and flexibility, DDIM retains some limitations:

It is not a likelihood model; log $p_\theta(x_0)$ is generally intractable, precluding direct use in maximum-likelihood pipelines (Song et al., 2020).
Deterministic sampling can reduce output diversity unless the terminal noise $x_T$ is randomized or partial noise is reintroduced ( $\eta>0$ ) (Shah et al., 2024).
In few-step regimes, certain high-order statistics or perceptual features may degrade unless extended with moment-matching, distillation, or relaxation steps (Gabbur, 2023, Berthelot et al., 2023, Chen et al., 5 Mar 2025).
For nonlinear or structural constraints beyond linear inverse problems, efficient integration in a deterministic sampler remains an open area (Jayaram et al., 2024).
Extension to non-isotropic or SDE-augmented processes requires specialized reparameterization, as in gDDIM (Zhang et al., 2022).

Active research explores structure-aware schedules, meta-learned jump layouts, hybrid moment/constraint-enforcing kernels, and domain-adapted DDIM variants, reflecting DDIM's centrality as a computational backbone for modern generative modeling across data types and problem settings.

Markdown Upgrade to Chat

References (14)

Denoising Diffusion Implicit Models (2020)

Enhancing Diffusion Models for High-Quality Image Generation (2024)

MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation (2025)

Bi-Directional MS Lesion Filling and Synthesis Using Denoising Diffusion Implicit Model-based Lesion Repainting (2024)

gDDIM: Generalized denoising diffusion implicit models (2022)

Interpreting and Improving Diffusion Models from an Optimization Perspective (2023)

Improved DDIM Sampling with Moment Matching Gaussian Mixtures (2023)

DDIM sampling for Generative AIBIM, a faster intelligent structural design framework (2024)

Optimizing for the Shortest Path in Denoising Diffusion Model (2025)

10.

Constrained Diffusion Implicit Models (2024)

11.

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models (2023)

12.

Fully Spiking Denoising Diffusion Implicit Models (2023)

13.

Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior (2024)

14.

TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Denoising Diffusion Implicit Models (DDIM).