BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution (2403.10211v1)
Abstract: Diffusion models (DM) have achieved remarkable promise in image super-resolution (SR). However, most of them are tailored to solving non-blind inverse problems with fixed known degradation settings, limiting their adaptability to real-world applications that involve complex unknown degradations. In this work, we propose BlindDiff, a DM-based blind SR method to tackle the blind degradation settings in SISR. BlindDiff seamlessly integrates the MAP-based optimization into DMs, which constructs a joint distribution of the low-resolution (LR) observation, high-resolution (HR) data, and degradation kernels for the data and kernel priors, and solves the blind SR problem by unfolding MAP approach along with the reverse process. Unlike most DMs, BlindDiff firstly presents a modulated conditional transformer (MCFormer) that is pre-trained with noise and kernel constraints, further serving as a posterior sampler to provide both priors simultaneously. Then, we plug a simple yet effective kernel-aware gradient term between adjacent sampling iterations that guides the diffusion model to learn degradation consistency knowledge. This also enables to joint refine the degradation model as well as HR images by observing the previous denoised sample. With the MAP-based reverse diffusion process, we show that BlindDiff advocates alternate optimization for blur kernel estimation and HR image restoration in a mutual reinforcing manner. Experiments on both synthetic and real-world datasets show that BlindDiff achieves the state-of-the-art performance with significant model complexity reduction compared to recent DM-based methods. Code will be available at \url{https://github.com/lifengcs/BlindDiff}
- Blind super-resolution kernel estimation using an internal-gan. In NeurIPS, 2019.
- Low-complexity single-image super-resolution based on non-negative neighbour embedding. In BMVC, pages 135.1–135.10, 2012.
- Toward real-world single image super-resolution: A new benchmark and a new model. In ICCV, pages 3086–3095, 2019.
- Activating more pixels in image super-resolution transformer. In CVPR, pages 22367–22377, 2023.
- Improving diffusion models for inverse problems using manifold constraints. In NeurIPS, 2022.
- Parallel diffusion models of operator and image for blind inverse problems. In CVPR, pages 6059–6069, 2023a.
- Diffusion posterior sampling for general noisy inverse problems. In ICLR, 2023b.
- Imagenet: A large-scale hierarchical image database. PAMI, 43(12):4217–4228, 2015.
- Image super-resolution using deep convolutional networks. PAMI, 38(2):295–307, 2015.
- Frido: Feature pyramid diffusion for complex scene image synthesis. In AAAI, pages 579–587, 2023.
- Uncertainty learning in kernel estimation for multi-stage blind image super-resolution. In ECCV, pages 144–161, 2022.
- Generative diffusion prior for unified image restoration and enhancement. In CVPR, pages 9935–9946, 2023.
- Kxnet: A model-driven deep neural network for blind super-resolution. In ECCV, pages 235–253, 2022.
- Feature distillation interaction weighting network for lightweight image super-resolution. In AAAI, pages 661–669, 2022.
- Blind super-resolution with iterative kernel correction. In CVPR, pages 1604–1613, 2019.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. In NeurIPS, 2017.
- Denoising diffusion probabilistic models. In NeurIPS, 2020.
- Tackling the ill-posedness of super-resolution through adaptive target generation. In CVPR, pages 16236–16245, 2021.
- A style-based generator architecture for generative adversarial networks. In CVPR, pages 4401–4410, 2019.
- Denoising diffusion restoration models. In ICLRW, 2022.
- Adam: A method for stochastic optimization. In ICLR, 2015.
- Learning detail-structure alternative optimization for blind super-resolution. TMM, 25:2825–2838, 2023.
- Srdiff: Single image super-resolution with diffusion probabilistic models. Neurocomputing, 479:47–59, 2022.
- Swinir: Image restoration using swin transformer. In ICCVW, pages 1833–1844, 2021a.
- Mutual affine network for spatially variant kernel estimation in blind image super-resolution. In ICCV, pages 4076–4085, 2021b.
- Flow-based kernel prior with application to blind super-resolution. In CVPR, pages 10601–10610, 2021c.
- Unfolding the alternating optimization for blind super-resolution. In NeurIPS, 2020.
- Deep constrained least squares for blind image super-resolution. In CVPR, pages 17642–17652, 2022.
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In ICCV, page 416, 2001.
- Ntire 2017 challenge on single image super-resolution: Methods and results. In CVPRW, pages 1110–1121, 2017.
- Hierarchical text-conditional image generation with clip latents. arXiv: 2204.06125, 2022.
- High-resolution image synthesis with latent diffusion models. In CVPR, pages 10674–0685, 2022.
- Image super-resolution via iterative refinement. PAMI, 45(04):4713–4726, 2023.
- “zero-shot” super-resolution using deep internal learning. In CVPR, pages 3118–3126, 2018.
- Deep unsupervised learning using nonequilibrium thermodynamics. In ICML, pages 2256–2265, 2015.
- Score-based generative modeling through stochastic differential equations. In ICLR, 2021.
- Spectrum-to-kernel translation for accurate blind image super-resolution. In NeurIPS, 2021.
- Unsupervised degradation representation learning for blind super-resolution. In CVPR, pages 10581–10590, 2021a.
- Esrgan: Enhanced super-resolution generative adversarial networks. In ECCVW, pages 63–79, 2018.
- Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In ICCVW, pages 1905–1914, 2021b.
- Zero-shot image restoration using denoising diffusion null-space model. In ICML, 2022.
- Unpaired remote sensing image super-resolution with content-preserving weak supervision neural network. Sci. China Inf. Sci., 66(1):119105:1–119105:2, 2023.
- Kernel distillation based degradation estimation for blind super-resolution. In ICLR, 2023.
- Blind image super-resolution with elaborate degradation modeling on noise and kernel. In CVPR, pages 2128–2138, 2022.
- Multi-stage progressive image restoration. In CVPR, pages 14816–14826, 2021.
- Restormer: Efficient transformer for high-resolution image restoration. In CVPR, pages 5728–5739, 2022.
- Learning a single convolutional super-resolution network for multiple degradations. In CVPR, pages 3262–3271, 2018a.
- Deep unfolding network for image super-resolution. In CVPR, pages 3217–3226, 2020.
- Designing a practical degradation model for deep blind image super-resolution. In ICCV, pages 4771–4780, 2021.
- The unreasonable effectiveness of deep features as a perceptual metric. In CVPR, pages 586–595, 2018b.
- Image super-resolution using very deep residual channel attention networks. In ECCV, pages 294–310, 2018c.
- Learning correction filter via degradation-adaptive regression for blind single image super-resolution. In ICCV, pages 12365–12375, 2023.
- Denoising diffusion models for plug-and-play image restoration. In CVPRW, pages 1219–1229, 2023.
- Feng Li (286 papers)
- Yixuan Wu (35 papers)
- Zichao Liang (2 papers)
- Runmin Cong (59 papers)
- Huihui Bai (28 papers)
- Yao Zhao (272 papers)
- Meng Wang (1063 papers)