Patch-Wise Blurring Diffusion

Updated 20 March 2026

Patch-wise blurring diffusion is a framework that restores images by applying independent diffusion processes to overlapping patches, addressing localized noise and artifacts.
It employs a denoising diffusion probabilistic model with a guided reverse diffusion step that integrates measurement conditioning for unified tasks including denoising, deblurring, and super-resolution.
The method is particularly effective in thermal imaging, managing challenges like resolution loss and fixed pattern noise while utilizing limited and non-diverse training data.

Patch-wise blurring diffusion is a framework for image restoration that applies diffusion processes independently to spatially overlapping patches of an image, enabling localized modeling of noise and degradation artifacts. The approach was formalized for thermal imaging applications in the TDiff method, which addresses resolution loss, fixed pattern noise, and localized artifacts commonly found in thermal images from low-cost cameras. By modeling the diffusion prior at the patch level and integrating mechanisms for inverse problem guidance, patch-wise blurring diffusion achieves unified restoration across denoising, deblurring, and super-resolution tasks while managing limited and non-diverse training data (Dashpute et al., 7 Oct 2025).

1. Patch-based Diffusion Process

Patch-wise blurring diffusion decomposes a high-resolution image $x_0 \in \mathbb{R}^{H \times W}$ into a set of overlapping patches using an extraction operator $P_k$ . Each patch is of size $ps \times ps$ and is processed independently through a denoising diffusion probabilistic model (DDPM) framework. For each patch $k$ , the forward diffusion process is defined as a Markov chain: $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ where the schedule $\{\beta_t\}$ defines the noise variance and $\alpha_t = 1 - \beta_t$ , $\bar{\alpha}_t = \prod_{i=1}^t \alpha_i$ . The process runs from $t = 1$ to $T$ , and the perturbed patch at time $P_k$ 0 can be sampled directly from $P_k$ 1: $P_k$ 2 The diffusion process is applied to each patch independently, leveraging the localized nature of distortions commonly observed in thermal imaging (Dashpute et al., 7 Oct 2025).

2. Reverse Diffusion and Denoising

The reverse diffusion process, or denoising step, aims to recover clean patches from noisy observations by modeling: $P_k$ 3 where the mean $P_k$ 4 depends on a learned noise predictor $P_k$ 5 realized by a time-conditional U-Net: $P_k$ 6 The objective for training $P_k$ 7 on each patch is: $P_k$ 8 This approach enables learning a prior over small localized regions, allowing effective denoising and restoration of localized degradations (Dashpute et al., 7 Oct 2025).

3. Patch Extraction, Tiling, and Reconstruction

The image is partitioned into a tiled grid of overlapping patches:

For an image of width $P_k$ 9 and height $ps \times ps$ 0, with patch size $ps \times ps$ 1 and stride $ps \times ps$ 2, the number of patches horizontally is $ps \times ps$ 3, vertically $ps \times ps$ 4, and total patches $ps \times ps$ 5.
Each patch $ps \times ps$ 6 is indexed by starting coordinates $ps \times ps$ 7, determined by:

$ps \times ps$ 8

Patches are extracted such that $ps \times ps$ 9, for $k$ 0.

Overlapping denoised patches are reassembled into a full-resolution image via smooth windowing and normalization to avoid seams. The window function is the 2D raised-cosine (Hann) window: $k$ 1 Reconstruction is performed by weighted averaging: $k$ 2

$k$ 3

(Dashpute et al., 7 Oct 2025)

4. Inverse Problem Guidance and Measurement Conditioning

For applications such as deblurring or general inverse imaging, patch-wise blurring diffusion integrates explicit measurement guidance during reverse diffusion. Given degradation $k$ 4 (with $k$ 5 representing a linear degradation, e.g., blur or downsampling), at each time $k$ 6 an estimated clean patch is computed: $k$ 7 Measurement-consistent updates are enforced per patch via:

Back-projection: $k$ 8
Least squares correction: $k$ 9

The guided reverse step updates each $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 0 by incorporating a weighted combination of $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 1 and $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 2, controlled by a time-dependent parameter $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 3: $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 4 This mechanism allows the framework to act as a plug-and-play prior for inverse problems within the diffusion process (Dashpute et al., 7 Oct 2025).

5. Architectural and Training Details

The core denoiser network $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 5 is a grayscale, time-conditional U-Net, parameterized as follows:

Base channel count: $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 6 for $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 7, $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 8 for $q(P_k(x_t) \mid P_k(x_{t-1})) = \mathcal{N}\left(P_k(x_t); \sqrt{\alpha_t} P_k(x_{t-1}), \beta_t I_{ps^2}\right)$ 9
Channel multipliers: $\{\beta_t\}$ 0
Sinusoidal timestep embeddings are added at every resolution

For deblurring, the blur kernel or its frequency response is included as a second input channel, or provided via cross-attention at the bottleneck. The forward and reverse diffusion employ a schedule $\{\beta_t\}$ 1, $\{\beta_t\}$ 2, $\{\beta_t\}$ 3 steps. Training uses the Adam optimizer with learning rate $\{\beta_t\}$ 4 and batch size approximately $\{\beta_t\}$ 5 patches (Dashpute et al., 7 Oct 2025).

6. Inference Pipeline

At inference, the full restoration process proceeds as:

Initialize $\{\beta_t\}$ 6 as independent Gaussian noise.
For $\{\beta_t\}$ ${β_{t}}$ 7:
- Extract patches $\{\beta_t\}$ 8 for all $\{\beta_t\}$ 9.
- For each $\alpha_t = 1 - \beta_t$ 0, compute $\alpha_t = 1 - \beta_t$ 1 and $\alpha_t = 1 - \beta_t$ 2.
- Compute $\alpha_t = 1 - \beta_t$ 3, $\alpha_t = 1 - \beta_t$ 4 for each patch using local measurements.
- Update $\alpha_t = 1 - \beta_t$ 5 via the guided reverse step.
- Merge patches by windowed average to obtain $\alpha_t = 1 - \beta_t$ 6.
After $\alpha_t = 1 - \beta_t$ 7, the restored image $\alpha_t = 1 - \beta_t$ 8 is obtained.

This patch-based diffusion with smooth blending is directly implementable in PyTorch or TensorFlow and realizes the TDiff restoration pipeline (Dashpute et al., 7 Oct 2025).

7. Significance and Applications

Patch-wise blurring diffusion, as instantiated in TDiff, is the first framework to apply a learned diffusion prior at the patch level for thermal image restoration across multiple tasks and measurement settings. The approach leverages local structure for robust restoration on limited training data, provides a unified pipeline for denoising, deblurring, and super-resolution, and enables consistent restoration even under real measurement conditions. Its generality suggests applicability beyond thermal to other imaging modalities exhibiting local, patch-dependent degradation (Dashpute et al., 7 Oct 2025).

Markdown Report Issue Upgrade to Chat

References (1)

TDiff: Thermal Plug-And-Play Prior with Patch-Based Diffusion (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Patch-Wise Blurring Diffusion.