ENTED: Enhanced Neural Texture Extraction and Distribution for Reference-based Blind Face Restoration (2401.06978v1)
Abstract: We present ENTED, a new framework for blind face restoration that aims to restore high-quality and realistic portrait images. Our method involves repairing a single degraded input image using a high-quality reference image. We utilize a texture extraction and distribution framework to transfer high-quality texture features between the degraded input and reference image. However, the StyleGAN-like architecture in our framework requires high-quality latent codes to generate realistic images. The latent code extracted from the degraded input image often contains corrupted features, making it difficult to align the semantic information from the input with the high-quality textures from the reference. To overcome this challenge, we employ two special techniques. The first technique, inspired by vector quantization, replaces corrupted semantic features with high-quality code words. The second technique generates style codes that carry photorealistic texture information from a more informative latent space developed using the high-quality features in the reference image's manifold. Extensive experiments conducted on synthetic and real-world datasets demonstrate that our method produces results with more realistic contextual details and outperforms state-of-the-art methods. A thorough ablation study confirms the effectiveness of each proposed module.
- Progressive semantic-aware style transformation for blind face restoration. In CVPR, pages 11896–11905, 2021.
- Blind image super resolution with semantic-aware quantized texture prior. arXiv preprint arXiv:2202.13142, 2022.
- Fsrnet: End-to-end learning face super-resolution with facial priors. In CVPR, pages 2492–2501, 2018.
- Nafssr: Stereo image super-resolution using nafnet. In CVPR, pages 1239–1248, 2022.
- Arcface: Additive angular margin loss for deep face recognition. In CVPR, pages 4690–4699, 2019.
- Exemplar guided face image super-resolution without facial landmarks. In CVPRW, pages 0–0, 2019.
- Taming transformers for high-resolution image synthesis. In CVPR, pages 12873–12883, 2021.
- Image processing using multi-code gan prior. In CVPR, pages 3012–3021, 2020.
- Vqfr: Blind face restoration with vector-quantized dictionary and parallel decoder. arXiv preprint arXiv:2205.06803, 2022.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. NIPS, 30, 2017.
- Robust reference-based super-resolution via c2-matching. In CVPR, pages 2103–2112, 2021.
- Perceptual losses for real-time style transfer and super-resolution. In ECCV, pages 694–711. Springer, 2016.
- A style-based generator architecture for generative adversarial networks. In CVPR, pages 4401–4410, 2019.
- Analyzing and improving the image quality of stylegan. In CVPR, pages 8110–8119, 2020.
- Progressive face super-resolution via attention to facial landmark. arXiv preprint arXiv:1908.08239, 2019.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Imagenet classification with deep convolutional neural networks. Communications of the ACM, 60(6):84–90, 2017.
- Robust training of vector quantized bottleneck models. In IJCNN, pages 1–7. IEEE, 2020.
- Blind face restoration via deep multi-scale component dictionaries. In ECCV, pages 399–415. Springer, 2020.
- Enhanced blind face restoration with multi-exemplar images and adaptive spatial feature fusion. In CVPR, June 2020.
- Learning warped guidance for blind face restoration. In ECCV, September 2018.
- Learning dual memory dictionaries for blind face restoration. TPAMI, 2022.
- Masa-sr: Matching acceleration and spatial adaptation for reference-based image super-resolution. In CVPR, pages 6368–6377, 2021.
- Pulse: Self-supervised photo upsampling via latent space exploration of generative models. In CVPR, pages 2437–2445, 2020.
- Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 20(3):209–212, 2012.
- Neural texture extraction and distribution for controllable person image synthesis. In CVPR, pages 13535–13544, 2022.
- Encoding in style: a stylegan encoder for image-to-image translation. In CVPR, pages 2287–2296, 2021.
- Deep semantic face deblurring. In CVPR, pages 8260–8269, 2018.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
- Designing an encoder for stylegan image manipulation. ACM Transactions on Graphics (TOG), 40(4):1–14, 2021.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Bringing old photos back to life. In CVPR, pages 2747–2757, 2020.
- Towards real-world blind face restoration with generative facial prior. In CVPR, pages 9168–9178, 2021.
- Restoreformer: High-quality blind face restoration from undegraded key-value pairs. In CVPR, pages 17512–17521, 2022.
- Feature representation matters: End-to-end learning for reference-based image super-resolution. In ECCV, pages 230–245. Springer, 2020.
- Learning texture transformer network for image super-resolution. In CVPR, pages 5791–5800, 2020.
- Hifacegan: Face renovation via collaborative suppression and replenishment. In Proceedings of the 28th ACM International Conference on Multimedia, pages 1551–1560, 2020.
- Gan prior embedded network for blind face restoration in the wild. In CVPR, pages 672–681, 2021.
- Face super-resolution guided by facial component heatmaps. In ECCV, pages 217–233, 2018.
- The unreasonable effectiveness of deep features as a perceptual metric. In CVPR, pages 586–595, 2018.
- Image super-resolution by neural texture transfer. In CVPR, pages 7982–7991, 2019.
- Rethinking deep face restoration. In CVPR, pages 7652–7661, 2022.
- Towards robust blind face restoration with codebook lookup transformer. arXiv preprint arXiv:2206.11253, 2022.