Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery (2401.15105v1)
Abstract: The presence of cloud layers severely compromises the quality and effectiveness of optical remote sensing (RS) images. However, existing deep-learning (DL)-based Cloud Removal (CR) techniques encounter difficulties in accurately reconstructing the original visual authenticity and detailed semantic content of the images. To tackle this challenge, this work proposes to encompass enhancements at the data and methodology fronts. On the data side, an ultra-resolution benchmark named CUHK Cloud Removal (CUHK-CR) of 0.5m spatial resolution is established. This benchmark incorporates rich detailed textures and diverse cloud coverage, serving as a robust foundation for designing and assessing CR models. From the methodology perspective, a novel diffusion-based framework for CR called Diffusion Enhancement (DE) is proposed to perform progressive texture detail recovery, which mitigates the training difficulty with improved inference accuracy. Additionally, a Weight Allocation (WA) network is developed to dynamically adjust the weights for feature fusion, thereby further improving performance, particularly in the context of ultra-resolution image generation. Furthermore, a coarse-to-fine training strategy is applied to effectively expedite training convergence while reducing the computational complexity required to handle ultra-resolution images. Extensive experiments on the newly established CUHK-CR and existing datasets such as RICE confirm that the proposed DE framework outperforms existing DL-based methods in terms of both perceptual quality and signal fidelity.
- X. Zhang, W. Yu, and M.-O. Pun, “Multilevel deformable attention-aggregated networks for change detection in bitemporal remote sensing imagery,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–18, 2022.
- X. Yuan, J. Shi, and L. Gu, “A review of deep learning methods for semantic segmentation of remote sensing imagery,” Expert Systems with Applications, vol. 169, p. 114417, 2021.
- K. Li, G. Wan, G. Cheng, L. Meng, and J. Han, “Object detection in optical remote sensing images: A survey and a new benchmark,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 159, pp. 296–307, 2020.
- M. Xu, X. Jia, M. Pickering, and S. Jia, “Thin cloud removal from optical remote sensing images using the noise-adjusted principal components transform,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 149, pp. 215–225, 2019.
- G. Hu, X. Li, and D. Liang, “Thin cloud removal from remote sensing images using multidirectional dual-tree complex wavelet transform and transfer least square support vector regression,” Journal of Applied Remote Sensing, vol. 9, no. 1, pp. 095 053–095 053, 2015.
- T.-Y. Ji, D. Chu, X.-L. Zhao, and D. Hong, “A unified framework of cloud detection and removal based on low-rank and group sparse regularizations for multitemporal multispectral images,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2022.
- M. Xu, M. Pickering, A. J. Plaza, and X. Jia, “Thin cloud removal based on signal transmission principles and spectral mixture analysis,” IEEE Transactions on Geoscience and Remote Sensing, vol. 54, no. 3, pp. 1659–1669, 2015.
- Y. Chen, W. He, N. Yokoya, and T.-Z. Huang, “Blind cloud and cloud shadow removal of multitemporal images based on total variation regularized low-rank sparsity decomposition,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 157, pp. 93–107, 2019.
- J. Wang, P. A. Olsen, A. R. Conn, and A. C. Lozano, “Removing clouds and recovering ground observations in satellite image sequences via temporally contiguous robust matrix completion,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2016, pp. 2754–2763.
- H. Ding, Y. Zi, and F. Xie, “Uncertainty-based thin cloud removal network via conditional variational autoencoders,” in Proceedings of the Asian Conference on Computer Vision, 2022, pp. 469–485.
- I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial nets,” Advances in neural information processing systems, vol. 27, 2014.
- P. Singh and N. Komodakis, “Cloud-Gan: Cloud removal for sentinel-2 imagery using a cyclic consistent generative adversarial networks,” in Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2018, pp. 1772–1775.
- R. Jing, F. Duan, F. Lu, M. Zhang, and W. Zhao, “Denoising diffusion probabilistic feature-based network for cloud removal in sentinel-2 imagery,” Remote Sensing, vol. 15, no. 9, p. 2217, 2023.
- A. Meraner, P. Ebel, X. X. Zhu, and M. Schmitt, “Cloud removal in sentinel-2 imagery using a deep residual neural network and sar-optical data fusion,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 166, pp. 333–346, 2020.
- D. Ma, R. Wu, D. Xiao, and B. Sui, “Cloud removal from satellite images using a deep learning model with the cloud-matting method,” Remote Sensing, vol. 15, no. 4, p. 904, 2023.
- J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840–6851, 2020.
- J. Sui, X. Ma, X. Zhang, and M.-O. Pun, “GCRDN: Global context-driven residual dense network for remote sensing image super-resolution,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023.
- J. Sui, X. Ma, X. Zhang, and M.-O. Pun, “DTRN: Dual transformer residual network for remote sensing super-resolution,” in Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023, pp. 6041–6044.
- Y. Ma, H. Yang, W. Yang, J. Fu, and J. Liu, “Solving diffusion odes with optimal boundary conditions for better image super-resolution,” arXiv preprint arXiv:2305.15357, 2023.
- X. Tao, H. Gao, X. Shen, J. Wang, and J. Jia, “Scale-recurrent network for deep image deblurring,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 8174–8182.
- S.-J. Cho, S.-W. Ji, J.-P. Hong, S.-W. Jung, and S.-J. Ko, “Rethinking coarse-to-fine approach in single image deblurring,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 4641–4650.
- J. Yu, Z. Lin, J. Yang, X. Shen, X. Lu, and T. S. Huang, “Free-form image inpainting with gated convolution,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 4471–4480.
- P. Wang, B. Bayram, and E. Sertel, “A comprehensive review on deep learning based remote sensing image super-resolution methods,” Earth-Science Reviews, pp. 104–110, 2022.
- C. Thomas, T. Ranchin, L. Wald, and J. Chanussot, “Synthesis of multispectral images to high spatial resolution: A critical review of fusion methods based on remote sensing physics,” IEEE Transactions on Geoscience and Remote Sensing, vol. 46, no. 5, pp. 1301–1312, 2008.
- X. Zhang, W. Yu, M.-O. Pun, and W. Shi, “Cross-domain landslide mapping from large-scale remote sensing images using prototype-guided domain-aware progressive representation learning,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 197, pp. 1–17, 2023.
- D. Lin, G. Xu, X. Wang, Y. Wang, X. Sun, and K. Fu, “A remote sensing image dataset for cloud removal,” arXiv preprint arXiv:1901.00600, 2019.
- J. Li, Z. Wu, Z. Hu, Z. Li, Y. Wang, and M. Molinier, “Deep learning based thin cloud removal fusing vegetation red edge and short wave infrared spectral information for sentinel-2a imagery,” Remote Sensing, vol. 13, no. 1, p. 157, 2021.
- P. Ebel, Y. Xu, M. Schmitt, and X. X. Zhu, “SEN12MS-CR-TS: A remote-sensing data set for multimodal multitemporal cloud removal,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–14, 2022.
- S. Ji, P. Dai, M. Lu, and Y. Zhang, “Simultaneous cloud detection and removal from bitemporal remote sensing images using cascade convolutional neural networks,” IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 1, pp. 732–748, 2020.
- H. Pan, “Cloud removal for remote sensing imagery via spatial attention generative adversarial network,” arXiv preprint arXiv:2009.13015, 2020.
- M. Xu, F. Deng, S. Jia, X. Jia, and A. J. Plaza, “Attention mechanism-based generative adversarial networks for cloud removal in landsat images,” Remote sensing of environment, vol. 271, p. 112902, 2022.
- R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-resolution image synthesis with latent diffusion models,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10 684–10 695.
- Y. Benny and L. Wolf, “Dynamic dual-output diffusion models,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11 482–11 491.
- X. Zou, K. Li, J. Xing, Y. Zhang, S. Wang, L. Jin, and P. Tao, “DiffCR: A fast conditional diffusion framework for cloud removal from optical satellite images,” arXiv preprint arXiv:2308.04417, 2023.
- X. Zhao and K. Jia, “Cloud removal in remote sensing using sequential-based diffusion models,” Remote Sensing, vol. 15, no. 11, p. 2861, 2023.
- B. Fei, Z. Lyu, L. Pan, J. Zhang, W. Yang, T. Luo, B. Zhang, and B. Dai, “Generative diffusion prior for unified image restoration and enhancement,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 9935–9946.
- D. Zhou, Z. Yang, and Y. Yang, “Pyramid diffusion models for low-light image enhancement,” arXiv preprint arXiv:2305.10028, 2023.
- S. Mohajerani and P. Saeedi, “Cloud-Net: An end-to-end cloud detection algorithm for landsat 8 imagery,” in Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, July 2019, pp. 1029–1032.
- S. Mohajerani, T. A. Krammer, and P. Saeedi, “”A Cloud Detection Algorithm for Remote Sensing Images Using Fully Convolutional Neural Networks”,” in Proceedings of IEEE International Workshop on Multimedia Signal Processing (MMSP), Aug 2018, pp. 1–5.
- P. Dhariwal and A. Nichol, “Diffusion models beat GANs on image synthesis,” in Proceedings of Advances in Neural Information Processing Systems, vol. 34, 2021, pp. 8780–8794.
- R. Zhang, P. Isola, A. A. Efros, E. Shechtman, and O. Wang, “The unreasonable effectiveness of deep features as a perceptual metric,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 586–595.
- J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” in Proceedings of International Conference on Learning Representations, 2020.
- X. F. Zhang, C. C. Gu, and S. Y. Zhu, “Memory augment is all you need for image restoration,” arXiv preprint arXiv:2309.01377, 2023.
- W. Yu, X. Zhang, and M.-O. Pun, “Cloud removal in optical remote sensing imagery using multiscale distortion-aware networks,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2022.
- Jialu Sui (6 papers)
- Yiyang Ma (15 papers)
- Wenhan Yang (96 papers)
- Xiaokang Zhang (42 papers)
- Man-On Pun (28 papers)
- Jiaying Liu (99 papers)