Deep Image Prior (1711.10925v4)

Published 29 Nov 2017 in cs.CV and stat.ML

Abstract: Deep convolutional networks have become a popular tool for image generation and restoration. Generally, their excellent performance is imputed to their ability to learn realistic image priors from a large number of example images. In this paper, we show that, on the contrary, the structure of a generator network is sufficient to capture a great deal of low-level image statistics prior to any learning. In order to do so, we show that a randomly-initialized neural network can be used as a handcrafted prior with excellent results in standard inverse problems such as denoising, super-resolution, and inpainting. Furthermore, the same prior can be used to invert deep neural representations to diagnose them, and to restore images based on flash-no flash input pairs. Apart from its diverse applications, our approach highlights the inductive bias captured by standard generator network architectures. It also bridges the gap between two very popular families of image restoration methods: learning-based methods using deep convolutional networks and learning-free methods based on handcrafted image priors such as self-similarity. Code and supplementary material are available at https://dmitryulyanov.github.io/deep_image_prior .

Authors (3)

Dmitry Ulyanov (11 papers)
Andrea Vedaldi (195 papers)
Victor Lempitsky (56 papers)

Citations (2,912)

View on Semantic Scholar

Summary

The paper demonstrates that untrained ConvNet architectures act as effective handcrafted priors for image restoration tasks.
It employs energy minimization by reparameterizing images as outputs of random networks to solve inverse problems like denoising, super-resolution, and inpainting.
The study challenges traditional training paradigms by revealing that network architecture alone encapsulates key image statistics for unsupervised restoration.

Deep Image Prior: Evaluating the Implicit Power of ConvNet Architectures

The paper "Deep Image Prior" by Ulyanov, Vedaldi, and Lempitsky introduces a novel perspective on the utilization of deep convolutional networks (ConvNets) for image restoration and generation tasks. The authors challenge the conventional understanding that the efficacy of ConvNets is primarily due to learning from large datasets. Instead, they argue that the inherent structure of ConvNet architectures itself captures significant low-level image statistics, providing an inductive bias that can be leveraged independently of learned weights.

The authors first investigate the hypothesis that a randomly initialized ConvNet can serve as an effective handcrafted prior. They demonstrate that such a network, without any training on external data, can successfully tackle standard inverse problems such as denoising, super-resolution, and inpainting. This finding underscores the powerful inductive biases embedded within deep ConvNet architectures.

Methodology and Experiments

To substantiate their claims, the authors formulate these restoration tasks as energy minimization problems and solve them by reparametrizing the image as the output of a ConvNet with fixed random input but variable weights. The ConvNet is then optimized to minimize the chosen energy function, tailored to each specific problem. This optimization implicitly regularizes the solution space, effectively introducing a prior without explicit external learning.

Key Experiments and Results:

Denoising:
- The authors apply their method to images corrupted with Gaussian noise, demonstrating competitive results against state-of-the-art techniques like CBM3D and Non-Local Means (NLM).
- Interestingly, the approach can also handle real-world noise, achieving a notable PSNR (Peak Signal-to-Noise Ratio) of 41.95 on a specific dataset, highlighting its robustness.
Super-Resolution:
- For upscaling low-resolution images, their method matches the performance of complex learning-based models like SRResNet and LapSRN, despite not being trained on any data.
- The visual quality of the upsampled images, characterized by sharp edges and reduced artifacts, is particularly noteworthy.
Inpainting:
- The approach is employed to fill in missing regions of an image. Both small and large-hole inpainting results exhibit high fidelity to the surrounding context, outperforming methods like Shepard Convolutional Networks and convolutional sparse coding in many cases.
Other Applications:
- The method is also applied to high-frequency image enhancement and flash-no-flash reconstruction, where it successfully reduces noise while preserving the desired lighting conditions.
- Additionally, the authors explore the use of their deep image prior for natural pre-image and activation maximization tasks, providing interpretable visualizations of deep network representations.

Theoretical and Practical Implications

The authors make a bold claim: the intrinsic structure of generator ConvNets inherently encapsulates enough statistical properties of natural images to function effectively as a prior in restoration tasks, without any learned weights. This challenges the prevalent view that ConvNet success is largely due to extensive training on large datasets. Instead, the findings here suggest that network architecture itself is a crucial component of their effectiveness.

Implications:

Theoretical: The discovery that ConvNet architectures alone can encode significant priors prompts a reevaluation of what constitutes "learning" in deep learning. It hints at a deeper, more fundamental relationship between architectural design and representation of data.
Practical: While the method is computationally intense and not necessarily practical for real-time applications, it opens avenues for unsupervised and self-supervised learning techniques, especially in scenarios where annotated data is scarce or non-existent.

Future Developments:

The paper suggests that further exploration into novel network architectures could yield even stronger implicit priors, potentially enhancing the performance of deep networks in learning sparse or limited data environments.
It also raises the possibility of combining these implicit priors with learning-based approaches to further boost performance in image restoration tasks.

In conclusion, "Deep Image Prior" by Ulyanov et al. provides a compelling argument for the significant role of ConvNet architectures as powerful priors for image restoration. This insight not only deepens the understanding of ConvNets but also extends their applicability in various restoration tasks without the necessity for extensive pretraining.

PDF Markdown

Related Papers

GitHub

Tweets

https://twitter.com/BlackHC/status/1829462759144071231

https://twitter.com/UnderwaterBepis/status/1886125689402781975

YouTube

Show All Videos

HackerNews

Deep Image Prior (2017) (2 points, 0 comments)