DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better (1908.03826v1)

Published 10 Aug 2019 in cs.CV and cs.LG

Abstract: We present a new end-to-end generative adversarial network (GAN) for single image motion deblurring, named DeblurGAN-v2, which considerably boosts state-of-the-art deblurring efficiency, quality, and flexibility. DeblurGAN-v2 is based on a relativistic conditional GAN with a double-scale discriminator. For the first time, we introduce the Feature Pyramid Network into deblurring, as a core building block in the generator of DeblurGAN-v2. It can flexibly work with a wide range of backbones, to navigate the balance between performance and efficiency. The plug-in of sophisticated backbones (e.g., Inception-ResNet-v2) can lead to solid state-of-the-art deblurring. Meanwhile, with light-weight backbones (e.g., MobileNet and its variants), DeblurGAN-v2 reaches 10-100 times faster than the nearest competitors, while maintaining close to state-of-the-art results, implying the option of real-time video deblurring. We demonstrate that DeblurGAN-v2 obtains very competitive performance on several popular benchmarks, in terms of deblurring quality (both objective and subjective), as well as efficiency. Besides, we show the architecture to be effective for general image restoration tasks too. Our codes, models and data are available at: https://github.com/KupynOrest/DeblurGANv2

Authors (4)

Orest Kupyn (9 papers)
Tetiana Martyniuk (5 papers)
Junru Wu (23 papers)
Zhangyang Wang (375 papers)

Citations (816)

View on Semantic Scholar

Summary

DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better

The paper "DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better" introduces a novel approach to single image motion deblurring leveraging generative adversarial networks (GANs). The proposed method, DeblurGAN-v2, significantly improves upon its predecessor, DeblurGAN, in both effectiveness and efficiency. The authors present a sophisticated framework that integrates a Feature Pyramid Network (FPN) within the generator and a relativistic double-scale discriminator, offering superior deblurring performance while maintaining flexibility across various computational environments.

Summary of Key Innovations

Framework Enhancements:
- The authors incorporate the Feature Pyramid Network (FPN), traditionally used in object detection, into their deblurring architecture. This inclusion marks a first in the image restoration domain, enabling the network to process multi-scale features without the computational burden typically associated with multi-scale convolutional networks.
- The enhanced discriminator operates on both global and local scales, allowing for robust distinction between real and deblurred images by considering spatially contextual information.
Backbone Flexibility:
- DeblurGAN-v2 is designed to support various backbones, providing a unique balance between deblurring quality and computational efficiency. High-performance backbones like Inception-ResNet-v2 are employed for superior results, whereas lightweight alternatives like MobileNet and MobileNet-DSC facilitate real-time execution and reduced computational cost.
Loss Function Optimization:
- The authors replace the WGAN-GP used in DeblurGAN with a Relativistic Average GAN (RaGAN) combined with Least-Squares loss (LS), yielding RaGAN-LS. This new objective function is noted for improving training stability and accelerating convergence. Additionally, a hybrid loss function combining perceptual and pixel-space losses ensures both high perceptual quality and accurate color and texture reconstruction.

Experimental Results

The proposed DeblurGAN-v2 architecture demonstrates compelling performance across several benchmarks:

GoPro Dataset: DeblurGAN-v2, with the Inception-ResNet-v2 backbone, yields PSNR and SSIM values that are on par with or exceed state-of-the-art models like SRN, while achieving a 78% reduction in inference time. The lightweight MobileNet-DSC variant offers a remarkable 100-fold speed improvement over DeepDeblur, with competitive SSIM values.
Kohler Dataset: The framework maintains top-tier performance, with DeblurGAN-v2 (Inception-ResNet-v2) and SRN roughly tied for the highest PSNR and SSIM. Importantly, DeblurGAN-v2 achieves this while being computationally less intensive.
Real-world Applications: On the Lai dataset, which includes real-world blurry images with no available ground truth, subjective evaluations rank DeblurGAN-v2 (Inception-ResNet-v2) highest in perceptual quality, surpassing even the SRN model. This showcases its practical effectiveness in handling diverse and complex blurring artifacts.

Implications and Future Directions

The innovations presented in DeblurGAN-v2 have substantial practical and theoretical implications. Practically, the ability to deblur images in near real-time opens up new possibilities for deploying this technology in mobile applications and low-power edge devices. The framework's flexibility in trading off between performance and efficiency makes it adaptable across various resource-constrained scenarios.

Theoretically, incorporating FPN into deblurring tasks and employing a double-scale discriminator illustrates an effective strategy for leveraging hierarchical feature representations and multi-scale contextual information. The introduction of RaGAN-LS presents a promising direction for stabilizing GAN training in high-stakes restoration tasks and could catalyze further research into loss function formulations.

Conclusion

DeblurGAN-v2 demonstrates a significant step forward in the field of image deblurring by addressing both speed and quality. Future research could extend this framework to video deblurring and broader image restoration problems, fostering advancements across both application-specific and fundamental research avenues in computer vision and deep learning. The contributions of this work not only advance the state-of-the-art in deblurring efficiency but also open new research paths for improving and optimizing image restoration techniques.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - VITA-Group/DeblurGANv2: [ICCV 2019] "DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better" by Orest Kupyn, Tetiana Martyniuk, Junru Wu, Zhangyang Wang (992 stars)

Tweets

https://twitter.com/DrPyRepo/status/1824403113605837109

https://twitter.com/ducha_aiki/status/1926020361675579771

YouTube

Show All Videos