Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis (2203.13278v4)
Abstract: While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved. In this paper, we attempt to solve this problem from the perspective of network architecture design and training data synthesis. Specifically, for the network architecture design, we propose a swin-conv block to incorporate the local modeling ability of residual convolutional layer and non-local modeling ability of swin transformer block, and then plug it as the main building block into the widely-used image-to-image translation UNet architecture. For the training data synthesis, we design a practical noise degradation model which takes into consideration different kinds of noise (including Gaussian, Poisson, speckle, JPEG compression, and processed camera sensor noises) and resizing, and also involves a random shuffle strategy and a double degradation strategy. Extensive experiments on AGWN removal and real image denoising demonstrate that the new network architecture design achieves state-of-the-art performance and the new degradation model can help to significantly improve the practicability. We believe our work can provide useful insights into current denoising research.
- Fast image recovery using variable splitting and constrained optimization. IEEE transactions on image processing, 19(9):2345–2356, 2010.
- Ntire 2017 challenge on single image super-resolution: Dataset and study. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, volume 3, pages 126–135, July 2017.
- Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 3(1):1–122, 2011.
- Unprocessing images for learned raw denoising. In IEEE Conference on Computer Vision and Pattern Recognition, pages 11036–11045, 2019.
- A non-local algorithm for image denoising. In IEEE Conference on Computer Vision and Pattern Recognition, volume 2, pages 60–65, 2005.
- Learning how to combine internal and external denoising methods. In German Conference on Pattern Recognition, pages 121–130, 2013.
- Swin-unet: Unet-like pure transformer for medical image segmentation. In European Conference on Computer Vision Workshops, pages 205–218, 2023.
- Is denoising dead? IEEE Transactions on Image Processing, 19(4):895–911, 2009.
- Pre-trained image processing transformer. In IEEE Conference on Computer Vision and Pattern Recognition, pages 12299–12310, 2021.
- Image blind denoising with generative adversarial network based noise modeling. In IEEE Conference on Computer Vision and Pattern Recognition, pages 3155–3164, 2018.
- Trainable nonlinear reaction diffusion: A flexible framework for fast and effective image restoration. IEEE transactions on Pattern Analysis and Machine Intelligence, 39(6):1256–1272, 2016.
- Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Transactions on Image Processing, 16(8):2080–2095, 2007.
- Rich Franzen. Kodak lossless true color image suite. source: http://r0k. us/graphics/kodak, 4(2), 1999.
- What is the space of camera response functions? In IEEE Conference on Computer Vision and Pattern Recognition, pages II–602, 2003.
- Weighted nuclear norm minimization with application to image denoising. In IEEE Conference on Computer Vision and Pattern Recognition, pages 2862–2869, 2014.
- CMT: Convolutional neural networks meet vision transformers. In IEEE Conference on Computer Vision and Pattern Recognition, pages 12175–12185, 2022.
- Toward convolutional blind denoising of real photographs. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1712–1722, 2019.
- Samuel W Hasinoff. Photon, poisson noise.
- Identity mappings in deep residual networks. arXiv preprint arXiv:1603.05027, 2016.
- Single image super-resolution from transformed self-exemplars. In IEEE Conference on Computer Vision and Pattern Recognition, pages 5197–5206, 2015.
- Focnet: A fractional optimal control network for image denoising. In IEEE Conference on Computer Vision and Pattern Recognition, pages 6054–6063, 2019.
- Towards flexible blind JPEG artifacts removal. In IEEE International Conference on Computer Vision, pages 4997–5006, 2021.
- A plug-and-play priors approach for solving nonlinear imaging inverse problems. IEEE Signal Processing Letters, 24(12):1872–1876, 2017.
- Adam: A method for stochastic optimization. In International Conference for Learning Representations, 2015.
- Noise2void-learning denoising from single noisy images. In IEEE Conference on Computer Vision and Pattern Recognition, pages 2129–2137, 2019.
- The noise clinic: a blind image denoising algorithm. Image Processing On Line, 5:1–54, 2015.
- Stamatios Lefkimmiatis. Non-local color image denoising with convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition, pages 3587–3596, 2017.
- Bossnas: Exploring hybrid cnn-transformers with block-wisely self-supervised neural architecture search. In IEEE International Conference on Computer Vision, pages 12281–12291, 2021.
- LocalViT: Bringing locality to vision transformers. arXiv preprint arXiv:2104.05707, 2021.
- Swinir: Image restoration using swin transformer. In IEEE International Conference on Computer Vision Workshops, pages 1833–1844, 2021.
- Enhanced deep residual networks for single image super-resolution. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 136–144, 2017.
- Non-local recurrent network for image restoration. In Advances in Neural Information Processing Systems, pages 1673–1682, 2018.
- Swin transformer: Hierarchical vision transformer using shifted windows. In IEEE International Conference on Computer Vision, 2021.
- Learning a no-reference quality metric for single-image super-resolution. Computer Vision and Image Understanding, 158:1–16, 2017.
- Waterloo exploration database: New challenges for image quality assessment models. IEEE Transactions on Image Processing, 26(2):1004–1016, 2017.
- Non-local sparse models for image restoration. In IEEE International Conference on Computer Vision, pages 2272–2279, 2009.
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In IEEE International Conference on Computer Vision, volume 2, pages 416–423, July 2001.
- Making a “completely blind” image quality analyzer. IEEE Signal Processing Letters, 20(3):209–212, 2012.
- Dynamic attentive graph learning for image restoration. In IEEE International Conference on Computer Vision, pages 4328–4337, 2021.
- A holistic approach to cross-channel image noise modeling and its application to image denoising. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1683–1691, 2016.
- Dilated residual networks with symmetric skip connection for image denoising. Neurocomputing, 345:67–76, 2019.
- Benchmarking denoising algorithms with real photographs. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1586–1595, 2017.
- Neural nearest neighbors networks. In Advances in Neural Information Processing Systems, pages 1087–1098, 2018.
- Speckle noise and the detection of faint companions. Publications of the Astronomical Society of the Pacific, 111(759):587, 1999.
- Adaptive consistency prior based deep network for image denoising. In IEEE Conference on Computer Vision and Pattern Recognition, pages 8596–8606, 2021.
- U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 234–241, 2015.
- Fields of experts. International Journal of Computer Vision, 82(2):205–229, 2009.
- Shrinkage fields for effective image restoration. In IEEE Conference on Computer Vision and Pattern Recognition, pages 2774–2781, 2014.
- Learning non-local range markov random field for image restoration. In IEEE Conference on Computer Vision and Pattern Recognition, pages 2745–2752, 2011.
- Image denoising using deep cnn with batch renormalization. Neural Networks, 121:461–473, 2020.
- When is speckle noise multiplicative? Applied optics, 21(7):1157–1159, 1982.
- Blind image quality evaluation using perception based features. In Twenty First National Conference on Communications, pages 1–6, 2015.
- Real-ESRGAN: Training real-world blind super-resolution with pure synthetic data. In IEEE International Conference on Computer Vision Workshops, 2021.
- Uformer: A general u-shaped transformer for image restoration. In IEEE Conference on Computer Vision and Pattern Recognition, pages 17683–17693, 2022.
- Incorporating convolution designs into visual transformers. In IEEE International Conference on Computer Vision, pages 579–588, 2021.
- Variational denoising network: Toward blind noise modeling and removal. In NeurIPS, 2019.
- Restormer: Efficient transformer for high-resolution image restoration. In IEEE Conference on Computer Vision and Pattern Recognition, pages 5728–5739, 2022.
- Plug-and-play image restoration with deep denoiser prior. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021.
- Designing a practical degradation model for deep blind image super-resolution. In IEEE International Conference on Computer Vision, pages 4791–4800, 2021.
- Beyond a gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Transactions on Image Processing, pages 3142–3155, 2017.
- FFDNet: Toward a fast and flexible solution for cnn-based image denoising. IEEE Transactions on Image Processing, 27(9):4608–4622, 2018.
- Color demosaicking by local directional interpolation and nonlocal adaptive thresholding. Journal of Electronic Imaging, 20(2):023016, 2011.
- Residual non-local attention networks for image restoration. In International Conference on Learning Representations, 2019.
- Residual dense network for image restoration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(7):2480–2495, 2020.
- Kai Zhang (543 papers)
- Yawei Li (72 papers)
- Jingyun Liang (24 papers)
- Jiezhang Cao (38 papers)
- Yulun Zhang (168 papers)
- Hao Tang (379 papers)
- Deng-Ping Fan (88 papers)
- Radu Timofte (299 papers)
- Luc Van Gool (570 papers)