A Deep Journey into Super-resolution: A survey (1904.07523v3)

Published 16 Apr 2019 in cs.CV

Abstract: Deep convolutional networks based super-resolution is a fast-growing field with numerous practical applications. In this exposition, we extensively compare 30+ state-of-the-art super-resolution Convolutional Neural Networks (CNNs) over three classical and three recently introduced challenging datasets to benchmark single image super-resolution. We introduce a taxonomy for deep-learning based super-resolution networks that groups existing methods into nine categories including linear, residual, multi-branch, recursive, progressive, attention-based and adversarial designs. We also provide comparisons between the models in terms of network complexity, memory footprint, model input and output, learning details, the type of network losses and important architectural differences (e.g., depth, skip-connections, filters). The extensive evaluation performed, shows the consistent and rapid growth in the accuracy in the past few years along with a corresponding boost in model complexity and the availability of large-scale datasets. It is also observed that the pioneering methods identified as the benchmark have been significantly outperformed by the current contenders. Despite the progress in recent years, we identify several shortcomings of existing techniques and provide future research directions towards the solution of these open problems.

Authors (3)

Saeed Anwar (64 papers)
Salman Khan (244 papers)
Nick Barnes (81 papers)

Citations (365)

View on Semantic Scholar

Summary

A Comprehensive Overview of Super-Resolution via Deep Learning

The paper "A Deep Journey into Super-resolution: A Survey" provides a detailed exposition on single-image super-resolution (SISR) leveraging deep convolutional networks. The authors meticulously review over 30 state-of-the-art SISR Convolutional Neural Networks (CNNs), assessing their performance across several datasets, both classical and recently introduced. This survey underscores the burgeoning progress in SISR accuracy accompanied by an uptick in model complexity and dataset availability, underscoring substantial advancements in the field.

Methodological Categorization and Analysis

The paper introduces a novel taxonomy for SISR methods, categorizing them into nine distinct types based on architectural features:

Linear Networks: These employ a single path for information flow without complex skip connections. SRCNN and FSRCNN epitomize this category, focusing on early or late upsampling designs to balance computational efficiency and accuracy.
Residual Networks: These incorporate skip connections to facilitate gradient flow and enable deeper network structures. EDSR and CARN are key examples, showcasing the benefits of both single-stage and multi-stage designs.
Recursive Networks: Utilizing recursive structures, these methods like DRCN and DRRN recursively apply network layers to enhance the learning depth without increasing parameter count.
Progressive Reconstruction Designs: For larger scaling factors, methods like LapSRN use progressive prediction to iteratively improve resolution, capitalizing on a pyramidal network structure.
Densely Connected Networks: Inspired by DenseNet architecture, models such as SRDenseNet provide rich feature propagation via dense connections, facilitating enhanced SR outcomes.
Multi-Branch Designs: These architectures, exemplified by CNF and CMSC, employ multiple parallel branches to obtain varied feature perspectives, merging them to improve final results.
Attention-Based Networks: Recent models like RCAN leverage attention mechanisms to selectively focus on pertinent features, demonstrating improved outcomes in SR tasks.
Handling Multiple Degradations: Models such as ZSSR and SRMD address the reality of multiple degradations in input images, offering versatile solutions without overfitting to specific degradations.
Generative Adversarial Networks (GANs): Approaches like SRGAN and ESRGAN use adversarial loss to produce sharper images, prioritizing perceptual quality over conventional quantitative metrics like PSNR.

Results and Implications

The paper presents a rigorous evaluative framework, benchmarking these models across different datasets, including Set5, Set14, BSD100, Urban100, DIV2K, and Manga109. It is evidenced that while GANs generally enhance perceptual image quality, traditional CNNs focusing on pixel-level accuracy yield higher PSNR values. Moreover, it is noted that deeper networks typically achieve superior results, though at the expense of increased computational complexity.

Open Challenges and Future Directions

The survey identifies several open research avenues within SISR:

Incorporation of Priors: Leveraging domain-specific knowledge could bolster the performance of SISR models in contexts where training data is scarce.
Objective Functions and Metrics: There is a need for developing more refined metrics that align closely with human perceptual evaluations.
Unified Solutions for Multiple Degradations: Designing SR models capable of simultaneously handling diverse degradations remains a complex challenge.
Unsupervised SISR: Exploring approaches for SISR without paired high-resolution images could offer practical benefits in real-world applications.
Higher and Arbitrary SR Rates: Tackling SR beyond 4x upsampling and arbitrary scaling remains a largely uncharted area needing innovative techniques.
Handling Real-World Degradations: Addressing the discrepancies between synthetic and real-world degradations is pertinent for the deployment of SISR in practical scenarios.

Conclusion

This comprehensive survey highlights substantial leaps in SISR achieved through deep learning, while also identifying critical gaps needing further exploration. As this field continues to evolve, addressing these challenges will undoubtedly pave the way for more robust and versatile SISR solutions, benefiting applications ranging from medical imaging to consumer electronics.

PDF Markdown

Related Papers

Find Related Papers