PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance (2309.10810v1)
Abstract: Exploiting pre-trained diffusion models for restoration has recently become a favored alternative to the traditional task-specific training approach. Previous works have achieved noteworthy success by limiting the solution space using explicit degradation models. However, these methods often fall short when faced with complex degradations as they generally cannot be precisely modeled. In this paper, we propose PGDiff by introducing partial guidance, a fresh perspective that is more adaptable to real-world degradations compared to existing works. Rather than specifically defining the degradation process, our approach models the desired properties, such as image structure and color statistics of high-quality images, and applies this guidance during the reverse diffusion process. These properties are readily available and make no assumptions about the degradation process. When combined with a diffusion prior, this partial guidance can deliver appealing results across a range of restoration tasks. Additionally, PGDiff can be extended to handle composite tasks by consolidating multiple high-quality image properties, achieved by integrating the guidance from respective tasks. Experimental results demonstrate that our method not only outperforms existing diffusion-prior-based approaches but also competes favorably with task-specific models.
- Large scale GAN training for high fidelity natural image synthesis. In ICLR, 2019.
- Super-resolution with deep convolutional sufficient statistics. In ICLR, 2016.
- GLEAN: Generative latent bank for large-factor image super-resolution. In CVPR, 2021.
- GLEAN: Generative latent bank for image super-resolution and beyond. TPAMI, 2022.
- Attend-and-Excite: Attention-based semantic guidance for text-to-image diffusion models. In SIGGRAPH, 2023.
- ArcFace: Additive angular margin loss for deep face recognition. In CVPR, 2019.
- Diffusion models beat GANs on image synthesis. In NeurIPS, 2021.
- Taming transformers for high-resolution image synthesis. In CVPR, 2021.
- Generative diffusion prior for unified image restoration and enhancement. In CVPR, 2023.
- Generative adversarial nets. In NIPS, 2014.
- Image processing using multi-code GAN prior. In CVPR, 2020.
- Convolutional sparse coding for image super-resolution. In ICCV, 2015.
- VQFR: Blind face restoration with vector-quantized dictionary and parallel decoder. In ECCV, 2022.
- GANs trained by a two time-scale update rule converge to a local nash equilibrium. In NeurIPS, 2017.
- Denoising diffusion probabilistic models. In NeurIPS, 2020.
- Perceptual losses for real-time style transfer and super-resolution. In ECCV, 2016.
- Progressive growing of GANs for improved quality, stability, and variation. In ICLR, 2018.
- A style-based generator architecture for generative adversarial networks. In CVPR, 2019.
- Analyzing and improving the image quality of StyleGAN. In CVPR, 2020.
- Denoising diffusion restoration models. In NeurIPS, 2022.
- MUSIQ: Multi-scale image quality transformer. In ICCV, 2021.
- Photo-realistic single image super-resolution using a generative adversarial network. In CVPR, 2017.
- Blind face restoration via deep multi-scale component dictionaries. In ECCV, 2020.
- Learning dual memory dictionaries for blind face restoration. TPAMI, 2022.
- SwinIR: Image restoration using swin transformer. In ICCV, 2021.
- PULSE: Self-supervised photo upsampling via latent space exploration of generative models. In CVPR, 2020.
- Making a “completely blind” image quality analyzer. IEEE Signal Processing Letters, 20(3):209–212, 2012.
- Exploiting deep generative prior for versatile image restoration and manipulation. In ECCV, 2020.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- ImageNet large scale visual recognition challenge. IJCV, 115:211–252, 2015.
- Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In CVPR, 2016.
- Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
- Deep unsupervised learning using nonequilibrium thermodynamics. In ICLR, 2015.
- Score-based generative modeling through stochastic differential equations. In ICLR, 2021.
- Neural discrete representation learning. In NeurIPS, 2017.
- Attention is all you need. In NeurIPS, 2017.
- Sketch-guided text-to-image diffusion models. arXiv preprint arXiv:2211.13752, 2022.
- Bringing old photos back to life. In CVPR, 2020.
- Exploiting diffusion prior for real-world image super-resolution. arXiv preprint arXiv:2305.07015, 2023.
- Towards real-world blind face restoration with generative facial prior. In CVPR, 2021.
- Real-ESRGAN: Training real-world blind super-resolution with pure synthetic data. In ICCV, 2021.
- Zero-shot image restoration using denoising diffusion null-space model. In ICLR, 2023.
- RestoreFormer: High-quality blind face restoration from undegraded key-value pairs. In CVPR, 2022.
- GAN prior embedded network for blind face restoration in the wild. In CVPR, 2021.
- DifFace: Blind face restoration with diffused error contraction. arXiv preprint arXiv:2212.06512, 2022.
- The unreasonable effectiveness of deep features as a perceptual metric. In CVPR, 2018.
- Rethinking deep face restoration. In CVPR, 2022.
- Towards robust blind face restoration with codebook lookup transformer. In NeurIPS, 2022.
- Peiqing Yang (9 papers)
- Shangchen Zhou (58 papers)
- Qingyi Tao (16 papers)
- Chen Change Loy (288 papers)