Deep Unfolding Network for Image Super-Resolution

Published 23 Mar 2020 in eess.IV and cs.CV | (2003.10428v1)

Abstract: Learning-based single image super-resolution (SISR) methods are continuously showing superior effectiveness and efficiency over traditional model-based methods, largely due to the end-to-end training. However, different from model-based methods that can handle the SISR problem with different scale factors, blur kernels and noise levels under a unified MAP (maximum a posteriori) framework, learning-based methods generally lack such flexibility. To address this issue, this paper proposes an end-to-end trainable unfolding network which leverages both learning-based methods and model-based methods. Specifically, by unfolding the MAP inference via a half-quadratic splitting algorithm, a fixed number of iterations consisting of alternately solving a data subproblem and a prior subproblem can be obtained. The two subproblems then can be solved with neural modules, resulting in an end-to-end trainable, iterative network. As a result, the proposed network inherits the flexibility of model-based methods to super-resolve blurry, noisy images for different scale factors via a single model, while maintaining the advantages of learning-based methods. Extensive experiments demonstrate the superiority of the proposed deep unfolding network in terms of flexibility, effectiveness and also generalizability.

Abstract PDF Upgrade to Chat

Citations (501)

View on Semantic Scholar

Summary

The paper’s main contribution is the development of USRNet, an end-to-end trainable deep unfolding network that integrates data and prior modules for versatile image super-resolution.
The method leverages a closed-form FFT-based solution and a ResUNet denoiser to handle various degradation conditions with a unified model.
Experimental results demonstrate that USRNet outperforms traditional approaches in PSNR and visual quality, simplifying practical image restoration.

Deep Unfolding Network for Image Super-Resolution: A Critical Analysis

The paper "Deep Unfolding Network for Image Super-Resolution" extends the capabilities of traditional image super-resolution (SISR) methods by proposing a novel deep unfolding network, termed USRNet. This network aims to bridge the gap between learning-based methods and model-based approaches, maintaining the efficiency of the former while incorporating the flexibility of the latter.

Core Contributions

The principal innovation of this paper lies in the development of an end-to-end trainable unfolding network for SISR, which combines aspects of both traditional and contemporary techniques. The network is constructed by unfolding the MAP inference via the half-quadratic splitting algorithm, resulting in a fixed number of iterations. Each iteration involves alternately solving two subproblems with neural modules: a data subproblem and a prior subproblem. This design allows USRNet to super-resolve low-resolution, blurry, and noisy images across different scale factors using a single unified model.

Key contributions of the paper include:

Unified Model Handling of Various Degradation Factors: USRNet is capable of addressing the classical SISR degradation model for different blur kernels, scale factors, and noise levels, unlike most traditional methods that require separate models for each setting.
End-to-End Trainability: By leveraging a deep learning approach, the network can be trained holistically, ensuring both the efficiency of runtime operations and the fidelity of image reconstruction.
Degradation and Prior Constraints Incorporation: The method explicitly integrates degradation and prior constraints into the network's structure, ensuring more reliable restoration outcomes.

Methodology

The USRNet architecture is built upon the unfolding optimization framework, which originates from the traditional MAP estimation methods. By unraveling this optimization procedure, the research proposes a network that alternates between solving the data term and the prior term using specific neural modules:

Data Module: Solves the data term through a closed-form solution facilitated by the fast Fourier transform (FFT), thereby maintaining degradation consistency.
Prior Module: Employs a novel CNN-based denoiser, termed ResUNet, which effectively restores image details by integrating residual learning mechanisms.
Hyper-Parameter Module: Facilitates dynamic control over module outputs, adapting to the scale factor and noise levels, thus enriching the model's flexibility and robustness.

Experimental Findings

The paper provides extensive experimental results, showcasing the superior performance of USRNet in comparison to contemporary methods across a range of degradation scenarios. Key outcomes from these experiments include:

Robust Performance: USRNet consistently yields higher PSNR values than alternatives across various kernel and noise configurations, indicating its adaptability and effectiveness.
Generalizability: The network performs favorably on a wide array of blur kernels and noise levels, demonstrating flexibility suitable for practical applications.
High-Quality Visual Results: Experiments illustrate that USRNet can recover high-resolution images with sharper details and improved perceptual quality, especially when trained for perceptual optimization using a combination of adversarial and perceptual loss terms.

Practical and Theoretical Implications

USRNet presents significant implications for both theoretical research and practical applications in computer vision:

Practical Applications: The ability to use a single trained model across various settings simplifies deployment in real-world applications where image degradations can be unpredictable.
Future Research Directions: The delineation of how hyper-parameters influence the unfolding process opens avenues for further refinement of adaptive parameter selection mechanisms.
Framework Expansion: The deep unfolding approach could be further explored in other inverse problem domains, potentially benefiting broader fields such as deblurring, denoising, and inpainting.

Conclusion

By dovetailing the flexibility of model-based methods with the robust, data-driven learning of neural networks, USRNet represents a significant advancement in the domain of SISR. The paper not only delivers a robust solution capable of handling diverse image degradation challenges through a unified model but also sets a foundational precedent for incorporating traditional optimization insight into modern deep learning architectures. Future explorations may focus on further tuning hyper-parameter modules for enhanced adaptability, underscoring the dynamic nature of this promising approach.

Markdown