Robust Unsupervised StyleGAN Image Restoration (2302.06733v2)
Abstract: GAN-based image restoration inverts the generative process to repair images corrupted by known degradations. Existing unsupervised methods must be carefully tuned for each task and degradation level. In this work, we make StyleGAN image restoration robust: a single set of hyperparameters works across a wide range of degradation levels. This makes it possible to handle combinations of several degradations, without the need to retune. Our proposed approach relies on a 3-phase progressive latent space extension and a conservative optimizer, which avoids the need for any additional regularization terms. Extensive experiments demonstrate robustness on inpainting, upsampling, denoising, and deartifacting at varying degradations levels, outperforming other StyleGAN-based inversion techniques. Our approach also favorably compares to diffusion-based restoration by yielding much more realistic inversion results. Code is available at https://lvsn.github.io/RobustUnsupervised/.
- Diffjpeg. https://github.com/mlomnitz/DiffJPEG.
- Libjpeg. https://libjpeg-turbo.org.
- Image2StyleGAN: How to embed images into the StyleGAN latent space? In Int. Conf. Comput. Vis., 2019.
- Image2stylegan++: How to edit the embedded images? In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- Brecahad: a dataset for breast cancer histopathological annotation and diagnosis. BMC research notes, 12(1):1–3, 2019.
- Compressed sensing using generative models. In Int. Conf. Mach. Learn., 2017.
- Large scale GAN training for high fidelity natural image synthesis. In Int. Conf. Learn. Represent., 2019.
- Any-resolution training for high-resolution image synthesis. In Eur. Conf. Comput. Vis., 2022.
- Pre-trained image processing transformer. In IEEE Conf. Comput. Vis. Pattern Recog., 2021.
- Proxiqa: A proxy approach to perceptual optimization of learned image compression. IEEE Trans. Image Process., 30:360–373, 2020.
- Stargan v2: Diverse image synthesis for multiple domains. In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- Alex Clark. Pillow (pil fork) documentation, 2015.
- Stylegan-induced data-driven regularization for inverse problems. In Int. Conf. Acous. Sp. Sig. Proc., 2022.
- Score-guided intermediate layer optimization: Fast Langevin mixing for inverse problems. In Int. Conf. Mach. Learn., 2022.
- Intermediate layer optimization for inverse problems using deep generative models. In Int. Conf. Mach. Learn., 2021.
- Arcface: Additive angular margin loss for deep face recognition. In IEEE Conf. Comput. Vis. Pattern Recog., 2019.
- Diffusion models beat gans on image synthesis. In Adv. Neural Inform. Process. Syst., 2021.
- Image inpainting: A review. Neural Processing Letters, 51:2007–2028, 2019.
- Brief review of image denoising techniques. Visual Computing for Industry, Biomedicine and Art, 2, 2019.
- Generative adversarial nets. In Adv. Neural Inform. Process. Syst., 2014.
- Image processing using multi-code gan prior. In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- Vqfr: Blind face restoration with vector-quantized dictionary and parallel decoder. In Eur. Conf. Comput. Vis., 2022.
- Ganspace: Discovering interpretable gan controls. In Adv. Neural Inform. Process. Syst., 2020.
- GANs trained by a two time-scale update rule converge to a local nash equilibrium. In Adv. Neural Inform. Process. Syst., 2017.
- Image-adaptive gan based reconstruction. In Ass. Adv. Artif. Intel., 2020.
- Gan inversion for out-of-range images with geometric transformations. In Int. Conf. Comput. Vis., 2021.
- Scaling up GANs for text-to-image synthesis. In IEEE Conf. Comput. Vis. Pattern Recog., 2023.
- Training generative adversarial networks with limited data. In Adv. Neural Inform. Process. Syst., 2020.
- Alias-free generative adversarial networks. In Adv. Neural Inform. Process. Syst., 2021.
- A style-based generator architecture for generative adversarial networks. In IEEE Conf. Comput. Vis. Pattern Recog., 2019.
- Analyzing and improving the image quality of StyleGAN. In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- Denoising diffusion restoration models. In Adv. Neural Inform. Process. Syst., 2022.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Faceformer: Scale-aware blind face restoration with transformers. ArXiv, abs/2207.09790, 2022.
- From beginner to master: A survey for deep learning-based single-image super-resolution. arXiv preprint arXiv:2109.14335, 2021.
- Enhanced blind face restoration with multi-exemplar images and adaptive spatial feature fusion. In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- Swinir: Image restoration using swin transformer. In Int. Conf. Comput. Vis., 2021.
- Time-travel rephotography. ACM Trans. Graph., 40(6):1–12, 2021.
- Bayesian image reconstruction using deep generative models. arXiv preprint arXiv:2012.04567, 2020.
- Pulse: Self-supervised photo upsampling via latent space exploration of generative models. In IEEE Conf. Comput. Vis. Pattern Recog., 2020.
- NVlabs. Stylegan2 ada - pytorch. https://github.com/NVlabs/stylegan2-ada-pytorch.
- Exploiting deep generative prior for versatile image restoration and manipulation. IEEE Trans. Pattern Anal. Mach. Intell., 2021.
- Styleclip: Text-driven manipulation of stylegan imagery. In Int. Conf. Comput. Vis., 2021.
- Overparameterization improves StyleGAN inversion. In IEEE Conf. Comput. Vis. Pattern Recog., 2022.
- Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125, 2022.
- Pivotal tuning for latent-based editing of real images. ACM Trans. Graph., 42(1), Feb. 2023.
- High-resolution image synthesis with latent diffusion models. In IEEE Conf. Comput. Vis. Pattern Recog., 2022.
- Palette: Image-to-image diffusion models. ACM Trans. Graph., pages 1–10, 2022.
- Photorealistic text-to-image diffusion models with deep language understanding. arXiv preprint arXiv:2205.11487, 2022.
- StyleGAN-XL: Scaling stylegan to large diverse datasets. ACM Trans. Graph., 1, 2022.
- Interfacegan: Interpreting the disentangled face representation learned by gans. IEEE Trans. Pattern Anal. Mach. Intell., 2020.
- Jpeg-resistant adversarial images. In NeurIPS Workshop Mach. Learn. Comp. Sec., 2017.
- Deep unsupervised learning using nonequilibrium thermodynamics. In Int. Conf. Mach. Learn., 2015.
- Maxim: Multi-axis MLP for image processing. In IEEE Conf. Comput. Vis. Pattern Recog., 2022.
- Stylegan2 distillation for feed-forward image manipulation. In Eur. Conf. Comput. Vis., 2020.
- Towards real-world blind face restoration with generative facial prior. In IEEE Conf. Comput. Vis. Pattern Recog., 2021.
- Deep learning for image super-resolution: A survey. IEEE Trans. Pattern Anal. Mach. Intell., 43(10):3365–3387, 2020.
- Machine Learning Refined: Foundations, Algorithms, and Applications. Cambridge University Press, 2016.
- GAN inversion: A survey. IEEE Trans. Pattern Anal. Mach. Intell., 2022.
- Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
- Learning enriched features for real image restoration and enhancement. In Eur. Conf. Comput. Vis., 2020.
- Multi-stage progressive image restoration. In IEEE Conf. Comput. Vis. Pattern Recog., 2021.
- The unreasonable effectiveness of deep features as a perceptual metric. In IEEE Conf. Comput. Vis. Pattern Recog., 2018.
- Towards robust blind face restoration with codebook lookup transformer, 2022.
- Blind face restoration via integrating face shape and generative priors. In IEEE Conf. Comput. Vis. Pattern Recog., 2022.
- Improved stylegan embedding: Where are the good latents? arXiv preprint arXiv:2012.09036, 2020.