GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration (2404.00807v1)
Abstract: Deep learning-based methods have shown remarkable success for various image restoration tasks such as denoising and deblurring. The current state-of-the-art networks are relatively deep and utilize (variants of) self attention mechanisms. Those networks are significantly slower than shallow convolutional networks, which however perform worse. In this paper, we introduce an image restoration network that is both fast and yields excellent image quality. The network is designed to minimize the latency and memory consumption when executed on a standard GPU, while maintaining state-of-the-art performance. The network is a simple shallow network with an efficient block that implements global additive multidimensional averaging operations. This block can capture global information and enable a large receptive field even when used in shallow networks with minimal computational overhead. Through extensive experiments and evaluations on diverse tasks, we demonstrate that our network achieves comparable or even superior results to existing state-of-the-art image restoration networks with less latency. For instance, we exceed the state-of-the-art result on real-world SIDD denoising by 0.11dB, while being 2 to 10 times faster.
- Abdelrahman Abdelhamed, Stephen Lin and Michael S. Brown “A High-Quality Denoising Dataset for Smartphone Cameras” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
- Abien Fred Agarap “Deep Learning using Rectified Linear Units (ReLU)” In arXiv preprint, 2018 arXiv:1803.08375
- “NTIRE 2017 challenge on single image super resolution: Dataset and study” In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2021
- “Contour Detection and Hierarchical Image Segmentation” In IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2011
- Jimmy Lei Ba, Jamie Ryan Kiros and Geoffrey E. Hinton “Layer Normalization” In arXiv preprint, 2016 arXiv:1607.06450
- “Unprocessing Images for Learned Raw Denoising” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
- Harold C. Burger, Christian J. Schuler and Stefan Harmeling “Image denoising: Can plain neural networks compete with BM3D?” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2012
- George Cazenavette and Manuel Ladron De Guevara “MixerGAN: An MLP-Based Architecture for Unpaired Image-to-Image Translation” In arXiv preprint, 2021 arXiv:2105.14110
- “Simple Baselines for Image Restoration” In European Conference on Computer Vision (ECCV), 2022
- “HINet: Half Instance Normalization Network for Image Restoration” In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2021
- “Dynamic Convolution: Attention Over Convolution Kernels” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
- “Rethinking Coarse-To-Fine Approach in Single Image Deblurring” In IEEE/CVF International Conference on Computer Vision (ICCV), 2021
- François Chollet “Xception: Deep Learning with Depthwise Separable Convolutions” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017
- “Improving Image Restoration by Revisiting Global Information Aggregation” In arXiv preprint, 2021 arXiv:2112.04491
- “Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering” In IEEE Transactions on Image Processing (TIP), 2007
- “The Efficiency Misnomer” In International Conference on Learning Representations (ICLR), 2022
- “RepVGG: Making VGG-style ConvNets Great Again” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
- “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale” In International Conference on Learning Representations (ICLR), 2021
- “Selective Residual M-Net for Real Image Denoising” In European Signal Processing Conference (EUSIPCO), 2022
- Rich Franzen “Kodak lossless true color image suite” In http://r0k.us/graphics/kodak/, 1999
- “Removing Rain From Single Images via a Deep Detail Network” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017
- Shuxuan Guo, Jose M. Alvarez and Mathieu Salzmann “ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks” In Neural Information Processing Systems (NeurIPS), 2020
- “Deep Residual Learning for Image Recognition” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016
- “Gaussian Error Linear Units (GELUs)” In arXiv preprint, 2016 arXiv:1606.08415
- Geoffrey Hinton, Oriol Vinyals and Jeff Dean “Distilling the Knowledge in a Neural Network” In arXiv preprint, 2015 arXiv:1503.02531
- “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications” In arXiv preprint, 2017 arXiv:1704.04861
- Jie Hu, Li Shen and Gang Sun “Squeeze-and-Excitation Networks” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
- “Neighbor2Neighbor: Self-Supervised Denoising from Single Noisy Images” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
- “Fast and High-Quality Image Denoising via Malleable Convolutions” In European Conference on Computer Vision (ECCV), 2022
- Alexander Krull, Tim-Oliver Buchholz and Florian Jug “Noise2Void - Learning Denoising From Single Noisy Images” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
- C. Kuo “Understanding Convolutional Neural Networks with A Mathematical Model” In Journal of Visual Communication and Image Representation, 2016
- “Noise2Noise: Learning Image Restoration without Clean Data” In International Conference on Machine Learning (ICML), 2018
- “Efficient and Explicit Modelling of Image Hierarchies for Image Restoration” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
- “Rain Streak Removal Using Layer Priors” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016
- “SwinIR: Image Restoration Using Swin Transformer” In IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2021
- “Vision Transformers Enable Fast and Robust Accelerated MRI” In Medical Imaging with Deep Learning, 2022
- “Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows” In IEEE/CVF International Conference on Computer Vision (ICCV), 2021
- “Decoupled Weight Decay Regularization” In International Conference on Learning Representations (ICLR), 2019
- “SGDR: Stochastic Gradient Descent with Warm Restarts” In International Conference on Learning Representations (ICLR), 2017
- “Waterloo exploration database: New challenges for image quality assessment models” In IEEE Transacions on Image Processing (TIP), 2016
- “ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design” In European Conference on Computer Vision (ECCV), 2018
- Youssef Mansour, Kang Lin and Reinhard Heckel “Image-to-Image MLP-Mixer for Image Reconstruction” In arXiv preprint, 2022 arXiv:2202.02018
- “A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics” In IEEE/CVF International Conference on Computer Vision (ICCV), 2001
- “Noisier2Noise: Learning to Denoise From Unpaired Noisy Data” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
- Seungjun Nah, Tae Hyun Kim and Kyoung Mu Lee “Deep multi-scale convolutional neural network for dynamic scene deblurring” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017
- “Attention U-Net: Learning Where to Look for the Pancreas” In Medical Imaging with Deep Learning, 2018
- “Spatially-Adaptive Image Restoration Using Distortion-Guided Networks” In IEEE/CVF International Conference on Computer Vision (ICCV), 2021
- Olaf Ronneberger, Philipp Fischer and Thomas Brox “U-Net: Convolutional Networks for Biomedical Image Segmentation” In Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2015
- “Human-Aware Motion Deblurring” In IEEE/CVF International Conference on Computer Vision (ICCV), 2019
- “Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016
- “Very Deep Convolutional Networks for Large-Scale Image Recognition” In International Conference on Learning Representations (ICLR), 2015
- “End-to-End Variational Networks for Accelerated MRI Reconstruction” In arXiv preprint, 2020 arXiv:2004.06688
- “EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks” In International Conference on Machine Learning (ICML), 2019
- Chunwei Tian, Yong Xu and Wangmeng Zuo “Image Denoising Using Deep CNN with Batch Renormalization” In Neural Networks, 2020
- “MLP-Mixer: An all-MLP Architecture for Vision” In Neural Information Processing Systems (NeurIPS), 2021
- “MAXIM: Multi-Axis MLP for Image Processing” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
- Dmitry Ulyanov, Andrea Vedaldi and Victor Lempitsky “Instance Normalization: The Missing Ingredient for Fast Stylization” In arXiv preprint, 2017 arXiv:1607.08022
- “MobileOne: An Improved One Millisecond Mobile Backbone” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
- “Attention is All you Need” In Neural Information Processing Systems (NeurIPS), 2017
- “ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
- “Non-Local Neural Networks” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
- “Blind2Unblind: Self-Supervised Image Denoising With Visible Blind Spots” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
- “Uformer: A General U-Shaped Transformer for Image Restoration” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
- “Image quality assessment: from error visibility to structural similarity” In IEEE Transactions on Image Processing, 2004
- “CBAM: Convolutional Block Attention Module” In European Conference on Computer Vision (ECCV), 2018
- “Deep Joint Rain Detection and Removal From a Single Image” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017
- “Restormer: Efficient Transformer for High-Resolution Image Restoration” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
- “Multi-Stage Progressive Image Restoration” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
- Matthew D. Zeiler and Rob Fergus “Visualizing and Understanding Convolutional Networks” In European Conference on Computer Vision (ECCV), 2014
- He Zhang and Vishal M. Patel “Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network” In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
- He Zhang, Vishwanath Sindagi and Vishal M. Patel “Image De-Raining Using a Conditional Generative Adversarial Network” In IEEE Transactions on Circuits and Systems for Video Technology, 2020
- “Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising” In IEEE Transactions on Image Processing, 2017
- “UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation” In IEEE Transactions on Medical Imaging, 2020