Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement (2404.17883v4)
Abstract: Underwater scenes intrinsically involve degradation problems owing to heterogeneous ocean elements. Prevailing underwater image enhancement (UIE) methods stick to straightforward feature modeling to learn the mapping function, which leads to limited vision gain as it lacks more explicit physical cues (e.g., depth). In this work, we investigate injecting the depth prior into the deep UIE model for more precise scene enhancement capability. To this end, we present a novel depth-guided perception UIE framework, dubbed underwater variable zoom (UVZ). Specifically, UVZ resorts to a two-stage pipeline. First, a depth estimation network is designed to generate critical depth maps, combined with an auxiliary supervision network introduced to suppress estimation differences during training. Second, UVZ parses near-far scenarios by harnessing the predicted depth maps, enabling local and non-local perceiving in different regions. Extensive experiments on five benchmark datasets demonstrate that UVZ achieves superior visual gain and delivers promising quantitative metrics. Besides, UVZ is confirmed to exhibit good generalization in some visual tasks, especially in unusual lighting conditions. The code, models and results are available at: https://github.com/WindySprint/UVZ.
- S. Zhang, S. Zhao, D. An, D. Li, and R. Zhao, “Liteenhancenet: A lightweight network for real-time single underwater image enhancement,” Expert Systems with Applications, vol. 240, p. 122546, 2024.
- C. W. Park and I. K. Eom, “Underwater image enhancement using adaptive standardization and normalization networks,” Engineering Applications of Artificial Intelligence, vol. 127, p. 107445, 2024.
- W. Zhang, P. Zhuang, H.-H. Sun, G. Li, S. Kwong, and C. Li, “Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement,” IEEE Transactions on Image Processing, vol. 31, pp. 3997–4010, 2022.
- Y. Guo, H. Li, and P. Zhuang, “Underwater image enhancement using a multiscale dense generative adversarial network,” IEEE Journal of Oceanic Engineering, vol. 45, no. 3, pp. 862–870, 2019.
- T. P. Marques and A. B. Albu, “L2uwe: A framework for the efficient enhancement of low-light underwater images using local contrast and multi-scale fusion,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020, pp. 538–539.
- J. Yuan, W. Cao, Z. Cai, and B. Su, “An underwater image vision enhancement algorithm based on contour bougie morphology,” IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 10, pp. 8117–8128, 2020.
- J. Yuan, Z. Cai, and W. Cao, “Tebcf: Real-world underwater image texture enhancement model based on blurriness and color fusion,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2022.
- W. Zhang, L. Zhou, P. Zhuang, G. Li, X. Pan, W. Zhao, and C. Li, “Underwater image enhancement via weighted wavelet visual perception fusion,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 4, pp. 2469–2483, 2024.
- C. Li, C. Guo, W. Ren, R. Cong, J. Hou, S. Kwong, and D. Tao, “An underwater image enhancement benchmark dataset and beyond,” IEEE Transactions on Image Processing, vol. 29, pp. 4376–4389, 2019.
- Z. Fu, X. Lin, W. Wang, Y. Huang, and X. Ding, “Underwater image enhancement via learning water type desensitized representations,” in IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2022, pp. 2764–2768.
- R. Liu, Z. Jiang, S. Yang, and X. Fan, “Twin adversarial contrastive learning for underwater image enhancement and beyond,” IEEE Transactions on Image Processing, vol. 31, pp. 4922–4936, 2022.
- Z. Fu, H. Lin, Y. Yang, S. Chai, L. Sun, Y. Huang, and X. Ding, “Unsupervised underwater image restoration: From a homology perspective,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 1, 2022, pp. 643–651.
- R. Cong, W. Yang, W. Zhang, C. Li, C.-L. Guo, Q. Huang, and S. Kwong, “Pugan: Physical model-guided underwater image enhancement using gan with dual-discriminators,” IEEE Transactions on Image Processing, vol. 32, pp. 4472–4485, 2023.
- R. Khan, P. Mishra, N. Mehta, S. S. Phutke, S. K. Vipparthi, S. Nandi, and S. Murala, “Spectroformer: Multi-domain query cascaded transformer network for underwater image enhancement,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024, pp. 1454–1463.
- C. O. Ancuti, C. Ancuti, C. De Vleeschouwer, and P. Bekaert, “Color balance and fusion for underwater image enhancement,” IEEE Transactions on Image Processing, vol. 27, no. 1, pp. 379–393, 2018.
- S. An, L. Xu, I. Senior Member, Z. Deng, and H. Zhang, “Hfm: A hybrid fusion method for underwater image enhancement,” Engineering Applications of Artificial Intelligence, vol. 127, p. 107219, 2024.
- F. Alenezi, A. Armghan, and K. Santosh, “Underwater image dehazing using global color features,” Engineering Applications of Artificial Intelligence, vol. 116, p. 105489, 2022.
- J. Zhou, B. Li, D. Zhang, J. Yuan, W. Zhang, Z. Cai, and J. Shi, “Ugif-net: An efficient fully guided information flow network for underwater image enhancement,” IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1–17, 2023.
- Y. Rao, W. Liu, K. Li, H. Fan, S. Wang, and J. Dong, “Deep color compensation for generalized underwater image enhancement,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 4, pp. 2577–2590, 2024.
- T. Chen, N. Wang, Y. Chen, X. Kong, Y. Lin, H. Zhao, and H. R. Karimi, “Semantic attention and relative scene depth-guided network for underwater image enhancement,” Engineering Applications of Artificial Intelligence, vol. 123, p. 106532, 2023.
- Y. Kang, Q. Jiang, C. Li, W. Ren, H. Liu, and P. Wang, “A perception-aware decomposition and fusion framework for underwater image enhancement,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 3, pp. 988–1002, 2023.
- D. Zhang, C. Wu, J. Zhou, W. Zhang, C. Li, and Z. Lin, “Hierarchical attention aggregation with multi-resolution feature learning for gan-based underwater image enhancement,” Engineering Applications of Artificial Intelligence, vol. 125, p. 106743, 2023.
- J. Zhou, Y. Wang, C. Li, and W. Zhang, “Multicolor light attenuation modeling for underwater image restoration,” IEEE Journal of Oceanic Engineering, vol. 48, no. 4, pp. 1322–1337, 2023.
- D. Berman, D. Levy, S. Avidan, and T. Treibitz, “Underwater single image color restoration using haze-lines and a new quantitative dataset,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 8, pp. 2822–2837, 2021.
- A. Chandrasekar, M. Sreenivas, and S. Biswas, “Phish-net: Physics inspired system for high resolution underwater image enhancement,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024, pp. 1506–1516.
- Y. Wang, J. Guo, H. Gao, and H. Yue, “Uiec^ 2-net: Cnn-based underwater image enhancement using two color space,” Signal Processing: Image Communication, vol. 96, p. 116250, 2021.
- N. Jiang, W. Chen, Y. Lin, T. Zhao, and C.-W. Lin, “Underwater image enhancement with lightweight cascaded network,” IEEE Transactions on Multimedia, vol. 24, pp. 4301–4313, 2022.
- D. Zhang, C. Wu, J. Zhou, W. Zhang, Z. Lin, K. Polat, and F. Alenezi, “Robust underwater image enhancement with cascaded multi-level sub-networks and triple attention mechanism,” Neural Networks, vol. 169, pp. 685–697, 2024.
- Y. Wang, S. Hu, S. Yin, Z. Deng, and Y.-H. Yang, “A multi-level wavelet-based underwater image enhancement network with color compensation prior,” Expert Systems with Applications, vol. 242, p. 122710, 2024.
- Q. Jiang, Y. Zhang, F. Bao, X. Zhao, C. Zhang, and P. Liu, “Two-step domain adaptation for underwater image enhancement,” Pattern Recognition, vol. 122, p. 108324, 2022.
- Z. Jiang, Z. Li, S. Yang, X. Fan, and R. Liu, “Target oriented perceptual adversarial fusion network for underwater image enhancement,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 10, pp. 6584–6598, 2022.
- K. Li, L. Wu, Q. Qi, W. Liu, X. Gao, L. Zhou, and D. Song, “Beyond single reference for training: underwater image enhancement via comparative learning,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 6, pp. 2561–2576, 2023.
- Q. Liu, X. Gao, L. He, and W. Lu, “Single image dehazing with depth-aware non-local total variation regularization,” IEEE Transactions on Image Processing, vol. 27, no. 10, pp. 5178–5191, 2018.
- M. Ju, C. Ding, C. A. Guo, W. Ren, and D. Tao, “Idrlp: Image dehazing using region line prior,” IEEE Transactions on Image Processing, vol. 30, pp. 9043–9057, 2021.
- Y. Yang, C. Wang, R. Liu, L. Zhang, X. Guo, and D. Tao, “Self-augmented unpaired image dehazing via density and depth decomposition,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 2037–2046.
- J. Zhou, Q. Liu, Q. Jiang, W. Ren, K.-M. Lam, and W. Zhang, “Underwater camera: Improving visual perception via adaptive dark pixel prior and color correction,” International Journal of Computer Vision, pp. 1–19, 2023.
- G. Hou, N. Li, P. Zhuang, K. Li, H. Sun, and C. Li, “Non-uniform illumination underwater image restoration via illumination channel sparsity prior,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 2, pp. 799–814, 2024.
- Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 10 012–10 022.
- T. Yu, Z. Guo, X. Jin, S. Wu, Z. Chen, W. Li, Z. Zhang, and S. Liu, “Region normalization for image inpainting,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 07, 2020, pp. 12 733–12 740.
- C. Li, S. Anwar, and F. Porikli, “Underwater scene prior inspired deep underwater image and video enhancement,” Pattern Recognition, vol. 98, p. 107038, 2020.
- C. Fabbri, M. J. Islam, and J. Sattar, “Enhancing underwater imagery using generative adversarial networks,” in IEEE International Conference on Robotics and Automation. IEEE, 2018, pp. 7159–7165.
- T. Ye, S. Chen, Y. Liu, Y. Ye, E. Chen, and Y. Li, “Underwater light field retention: Neural rendering for underwater imaging,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 488–497.
- L. Peng, C. Zhu, and L. Bian, “U-shape transformer for underwater image enhancement,” IEEE Transactions on Image Processing, vol. 32, pp. 3066–3079, 2023.
- M. J. Islam, P. Luo, and J. Sattar, “Simultaneous enhancement and super-resolution of underwater imagery for improved visual perception,” arXiv preprint arXiv:2002.01155, 2020.
- K. Panetta, C. Gao, and S. Agaian, “Human-visual-system-inspired underwater image quality measures,” IEEE Journal of Oceanic Engineering, vol. 41, no. 3, pp. 541–551, 2015.
- T. Lei, X. Jia, Y. Zhang, S. Liu, H. Meng, and A. K. Nandi, “Superpixel-based fast fuzzy c-means clustering for color image segmentation,” IEEE Transactions on Fuzzy Systems, vol. 27, no. 9, pp. 1753–1766, 2018.
- D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, pp. 91–110, 2004.
- C. Li, Y. Yuan, W. Cai, Y. Xia, and D. Dagan Feng, “Robust saliency detection via regularized random walks ranking,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2015, pp. 2710–2717.
- Y.-f. Zhang, J. Zheng, L. Li, N. Liu, W. Jia, X. Fan, C. Xu, and X. He, “Rethinking feature aggregation for deep rgb-d salient object detection,” Neurocomputing, vol. 423, pp. 463–473, 2021.
- C. Lee, C. Lee, and C.-S. Kim, “Contrast enhancement based on layered difference representation of 2d histograms,” IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 5372–5384, 2013.
- C. Xu, H. Fu, L. Ma, W. Jia, C. Zhang, F. Xia, X. Ai, B. Li, and W. Zhang, “Seeing text in the dark: Algorithm and benchmark,” arXiv preprint arXiv:2404.08965, 2024.
- V. Bychkovsky, S. Paris, E. Chan, and F. Durand, “Learning photographic global tonal adjustment with a database of input/output image pairs,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, 2011, pp. 97–104.
- Zhixiong Huang (3 papers)
- Xinying Wang (9 papers)
- Jinjiang Li (29 papers)
- Lin Feng (31 papers)
- Chengpei Xu (12 papers)