Simple Image Signal Processing using Global Context Guidance (2404.11569v2)
Abstract: In modern smartphone cameras, the Image Signal Processor (ISP) is the core element that converts the RAW readings from the sensor into perceptually pleasant RGB images for the end users. The ISP is typically proprietary and handcrafted and consists of several blocks such as white balance, color correction, and tone mapping. Deep learning-based ISPs aim to transform RAW images into DSLR-like RGB images using deep neural networks. However, most learned ISPs are trained using patches (small regions) due to computational limitations. Such methods lack global context, which limits their efficacy on full-resolution images and harms their ability to capture global properties such as color constancy or illumination. First, we propose a novel module that can be integrated into any neural ISP to capture the global context information from the full RAW images. Second, we propose an efficient and simple neural ISP that utilizes our proposed module. Our model achieves state-of-the-art results on different benchmarks using diverse and real smartphone images.
- “Mobile computational photography: A tour,” Annual Review of Vision Science, vol. 7, pp. 571–604, 2021.
- “Model-based image signal processors via learnable dictionaries,” in Proceedings of the AAAI Conference on Artificial Intelligence, 2022, vol. 36, pp. 481–489.
- “Replacing mobile camera isp with a single deep learning model,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020, pp. 536–537.
- “Cameranet: A two-stage framework for effective camera isp learning,” IEEE Transactions on Image Processing, vol. 30, pp. 2248–2262, 2019.
- “Learned smartphone ISP on mobile GPUs with deep learning, mobile AI & AIM 2022 challenge: report,” in European Conference on Computer Vision. Springer, 2022, pp. 44–70.
- “Deep-FlexISP: A three-stage framework for night photography rendering,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). jun 2022, IEEE.
- “MicroISP: processing 32mp photos on mobile devices with deep learning,” in European Conference on Computer Vision. Springer, 2022, pp. 729–746.
- “Transform your smartphone into a DSLR camera: Learning the ISP in the wild,” in European Conference on Computer Vision. Springer, 2022, pp. 625–641.
- “Learned smartphone ISP on mobile NPUs with deep learning, Mobile AI 2021 challenge: Report,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 2503–2514.
- “Illuminant estimation for color constancy: why spatial-domain methods work and the role of the color distribution,” JOSA A, vol. 31, no. 5, pp. 1049–1058, 2014.
- “Zoom to learn, learn to zoom,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 3762–3770.
- “Learning raw-to-srgb mappings with inaccurately aligned supervision,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 4348–4358.
- “Invertible image signal processing,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 6287–6296.
- “AWNet: Attentive wavelet network for image ISP,” in Computer Vision – ECCV 2020 Workshops, pp. 185–201. Springer International Publishing, 2020.
- “CBAM: convolutional block attention module,” in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 3–19.
- “CSANet: High speed channel spatial attention network for mobile ISP,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). jun 2021, IEEE.
- “Simple baselines for image restoration,” in ECCV, 2022.
- “Mobilenetv2: Inverted residuals and linear bottlenecks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4510–4520.
- “Layer normalization,” arXiv preprint arXiv:1607.06450, 2016.
- David G Lowe, “Distinctive image features from scale-invariant keypoints,” International journal of computer vision, vol. 60, pp. 91–110, 2004.
- “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Communications of the ACM, vol. 24, no. 6, pp. 381–395, 1981.
- “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
- “Measuring perceptual color differences of smartphone photographs,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
- “Perceptual losses for real-time style transfer and super-resolution,” in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14. Springer, 2016, pp. 694–711.
- “Bsraw: Improving blind raw image super-resolution,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024, pp. 8500–8510.
- “Photo-realistic single image super-resolution using a generative adversarial network,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4681–4690.
- “AIM 2020 challenge on learned image signal processing pipeline,” in Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16. Springer, 2020, pp. 152–170.
- Omar Elezabi (3 papers)
- Marcos V. Conde (99 papers)
- Radu Timofte (299 papers)