Patch-Wise Blurring Diffusion
- Patch-wise blurring diffusion is a framework that restores images by applying independent diffusion processes to overlapping patches, addressing localized noise and artifacts.
- It employs a denoising diffusion probabilistic model with a guided reverse diffusion step that integrates measurement conditioning for unified tasks including denoising, deblurring, and super-resolution.
- The method is particularly effective in thermal imaging, managing challenges like resolution loss and fixed pattern noise while utilizing limited and non-diverse training data.
Patch-wise blurring diffusion is a framework for image restoration that applies diffusion processes independently to spatially overlapping patches of an image, enabling localized modeling of noise and degradation artifacts. The approach was formalized for thermal imaging applications in the TDiff method, which addresses resolution loss, fixed pattern noise, and localized artifacts commonly found in thermal images from low-cost cameras. By modeling the diffusion prior at the patch level and integrating mechanisms for inverse problem guidance, patch-wise blurring diffusion achieves unified restoration across denoising, deblurring, and super-resolution tasks while managing limited and non-diverse training data (Dashpute et al., 7 Oct 2025).
1. Patch-based Diffusion Process
Patch-wise blurring diffusion decomposes a high-resolution image into a set of overlapping patches using an extraction operator . Each patch is of size and is processed independently through a denoising diffusion probabilistic model (DDPM) framework. For each patch , the forward diffusion process is defined as a Markov chain: where the schedule defines the noise variance and , . The process runs from to , and the perturbed patch at time 0 can be sampled directly from 1: 2 The diffusion process is applied to each patch independently, leveraging the localized nature of distortions commonly observed in thermal imaging (Dashpute et al., 7 Oct 2025).
2. Reverse Diffusion and Denoising
The reverse diffusion process, or denoising step, aims to recover clean patches from noisy observations by modeling: 3 where the mean 4 depends on a learned noise predictor 5 realized by a time-conditional U-Net: 6 The objective for training 7 on each patch is: 8 This approach enables learning a prior over small localized regions, allowing effective denoising and restoration of localized degradations (Dashpute et al., 7 Oct 2025).
3. Patch Extraction, Tiling, and Reconstruction
The image is partitioned into a tiled grid of overlapping patches:
- For an image of width 9 and height 0, with patch size 1 and stride 2, the number of patches horizontally is 3, vertically 4, and total patches 5.
- Each patch 6 is indexed by starting coordinates 7, determined by:
8
- Patches are extracted such that 9, for 0.
Overlapping denoised patches are reassembled into a full-resolution image via smooth windowing and normalization to avoid seams. The window function is the 2D raised-cosine (Hann) window: 1 Reconstruction is performed by weighted averaging: 2
3
4. Inverse Problem Guidance and Measurement Conditioning
For applications such as deblurring or general inverse imaging, patch-wise blurring diffusion integrates explicit measurement guidance during reverse diffusion. Given degradation 4 (with 5 representing a linear degradation, e.g., blur or downsampling), at each time 6 an estimated clean patch is computed: 7 Measurement-consistent updates are enforced per patch via:
- Back-projection: 8
- Least squares correction: 9
The guided reverse step updates each 0 by incorporating a weighted combination of 1 and 2, controlled by a time-dependent parameter 3: 4 This mechanism allows the framework to act as a plug-and-play prior for inverse problems within the diffusion process (Dashpute et al., 7 Oct 2025).
5. Architectural and Training Details
The core denoiser network 5 is a grayscale, time-conditional U-Net, parameterized as follows:
- Base channel count: 6 for 7, 8 for 9
- Channel multipliers: 0
- Sinusoidal timestep embeddings are added at every resolution
For deblurring, the blur kernel or its frequency response is included as a second input channel, or provided via cross-attention at the bottleneck. The forward and reverse diffusion employ a schedule 1, 2, 3 steps. Training uses the Adam optimizer with learning rate 4 and batch size approximately 5 patches (Dashpute et al., 7 Oct 2025).
6. Inference Pipeline
At inference, the full restoration process proceeds as:
- Initialize 6 as independent Gaussian noise.
- For 7:
- Extract patches 8 for all 9.
- For each 0, compute 1 and 2.
- Compute 3, 4 for each patch using local measurements.
- Update 5 via the guided reverse step.
- Merge patches by windowed average to obtain 6.
- After 7, the restored image 8 is obtained.
This patch-based diffusion with smooth blending is directly implementable in PyTorch or TensorFlow and realizes the TDiff restoration pipeline (Dashpute et al., 7 Oct 2025).
7. Significance and Applications
Patch-wise blurring diffusion, as instantiated in TDiff, is the first framework to apply a learned diffusion prior at the patch level for thermal image restoration across multiple tasks and measurement settings. The approach leverages local structure for robust restoration on limited training data, provides a unified pipeline for denoising, deblurring, and super-resolution, and enables consistent restoration even under real measurement conditions. Its generality suggests applicability beyond thermal to other imaging modalities exhibiting local, patch-dependent degradation (Dashpute et al., 7 Oct 2025).