MCSDNet: Mesoscale Convective System Detection Network via Multi-scale Spatiotemporal Information (2404.17186v1)
Abstract: The accurate detection of Mesoscale Convective Systems (MCS) is crucial for meteorological monitoring due to their potential to cause significant destruction through severe weather phenomena such as hail, thunderstorms, and heavy rainfall. However, the existing methods for MCS detection mostly targets on single-frame detection, which just considers the static characteristics and ignores the temporal evolution in the life cycle of MCS. In this paper, we propose a novel encoder-decoder neural network for MCS detection(MCSDNet). MCSDNet has a simple architecture and is easy to expand. Different from the previous models, MCSDNet targets on multi-frames detection and leverages multi-scale spatiotemporal information for the detection of MCS regions in remote sensing imagery(RSI). As far as we know, it is the first work to utilize multi-scale spatiotemporal information to detect MCS regions. Firstly, we design a multi-scale spatiotemporal information module to extract multi-level semantic from different encoder levels, which makes our models can extract more detail spatiotemporal features. Secondly, a Spatiotemporal Mix Unit(STMU) is introduced to MCSDNet to capture both intra-frame features and inter-frame correlations, which is a scalable module and can be replaced by other spatiotemporal module, e.g., CNN, RNN, Transformer and our proposed Dual Spatiotemporal Attention(DSTA). This means that the future works about spatiotemporal modules can be easily integrated to our model. Finally, we present MCSRSI, the first publicly available dataset for multi-frames MCS detection based on visible channel images from the FY-4A satellite. We also conduct several experiments on MCSRSI and find that our proposed MCSDNet achieve the best performance on MCS detection task when comparing to other baseline methods.
- T. Fiolleau and R. Roca, “An algorithm for the detection and tracking of tropical mesoscale convective systems using infrared images from geostationary satellite,” IEEE transactions on Geoscience and Remote Sensing, vol. 51, no. 7, pp. 4302–4315, 2013.
- Y. Wang and B. Xiao, “Convection-unet: A deep convolutional neural network for convection detection based on the geo high-speed imager of fengyun-4b,” in 2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA). IEEE, 2023, pp. 163–168.
- Y. Yang, C. Zhao, Y. Sun, Y. Chi, and H. Fan, “Convective cloud detection and tracking using the new-generation geostationary satellite over south china,” IEEE Transactions on Geoscience and Remote Sensing, 2023.
- Z. Zhu and C. E. Woodcock, “Object-based cloud and cloud shadow detection in landsat imagery,” Remote sensing of environment, vol. 118, pp. 83–94, 2012.
- Y. Zuo, Z. Hu, S. Yuan, J. Zheng, X. Yin, and B. Li, “Identification of convective and stratiform clouds based on the improved dbscan clustering algorithm,” Advances in Atmospheric Sciences, vol. 39, no. 12, pp. 2203–2212, 2022.
- B. Utsav, S. M. Deshpande, S. K. Das, and G. Pandithurai, “Statistical characteristics of convective clouds over the western ghats derived from weather radar observations,” Journal of Geophysical Research: Atmospheres, vol. 122, no. 18, pp. 10–050, 2017.
- M. Le Goff, J.-Y. Tourneret, H. Wendt, M. Ortner, and M. Spigai, “Deep learning for cloud detection,” in 8th International Conference of Pattern Recognition Systems (ICPRS 2017). IET, 2017, pp. 1–6.
- S. Li, W. Song, L. Fang, Y. Chen, P. Ghamisi, and J. A. Benediktsson, “Deep learning for hyperspectral image classification: An overview,” IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 9, pp. 6690–6709, 2019.
- S. K. Roy, A. Deria, D. Hong, B. Rasti, A. Plaza, and J. Chanussot, “Multimodal fusion transformer for remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1–20, 2023.
- Y. Huang, J. Peng, G. Zhang, W. Sun, N. Chen, and Q. Du, “Adversarial domain adaptation network with calibrated prototype and dynamic instance convolution for hyperspectral image classification,” IEEE Transactions on Geoscience and Remote Sensing, pp. 1–1, 2024.
- E. Basaeed, H. Bhaskar, and M. Al-Mualla, “Supervised remote sensing image segmentation using boosted convolutional neural networks,” Knowledge-Based Systems, vol. 99, pp. 19–27, 2016.
- S. Mohajerani and P. Saeedi, “Cloud-net: An end-to-end cloud detection algorithm for landsat 8 imagery,” in IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2019, pp. 1029–1032.
- J. Yang, J. Guo, H. Yue, Z. Liu, H. Hu, and K. Li, “Cdnet: Cnn-based cloud detection for remote sensing imagery,” IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 8, pp. 6195–6211, 2019.
- C. Luo, Z. Zhang, H. Lin, B. Zhang, X. Li, T. Zhang, and Y. Ye, “A practical online incremental learning framework for precipitation nowcasting,” IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–14, 2024.
- Z. Zhao, X. Dong, Y. Wang, and C. Hu, “Advancing realistic precipitation nowcasting with a spatiotemporal transformer-based denoising diffusion model,” IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–15, 2024.
- X. He, Y. Zhou, J. Zhao, D. Zhang, R. Yao, and Y. Xue, “Swin transformer embedding unet for remote sensing image semantic segmentation,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2022.
- L. Lv and L. Zhang, “Advancing data-efficient exploitation for semi-supervised remote sensing images semantic segmentation,” IEEE Transactions on Geoscience and Remote Sensing, pp. 1–1, 2024.
- X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-K. Wong, and W.-c. Woo, “Convolutional lstm network: A machine learning approach for precipitation nowcasting,” Advances in neural information processing systems, vol. 28, 2015.
- A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly et al., “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929, 2020.
- C. Tan, Z. Gao, L. Wu, Y. Xu, J. Xia, S. Li, and S. Z. Li, “Temporal attention unit: Towards efficient spatiotemporal predictive learning,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 18 770–18 782.
- R. S. Schumacher and R. H. Johnson, “Organization and environmental properties of extreme-rain-producing mesoscale convective systems,” Monthly weather review, vol. 133, no. 4, pp. 961–976, 2005.
- A. M. Haberlie and W. S. Ashley, “A radar-based climatology of mesoscale convective systems in the united states,” Journal of Climate, vol. 32, no. 5, pp. 1591–1606, 2019.
- D. Chen, J. Guo, D. Yao, Y. Lin, C. Zhao, M. Min, H. Xu, L. Liu, X. Huang, T. Chen et al., “Mesoscale convective systems in the asian monsoon region from advanced himawari imager: Algorithms and preliminary results,” Journal of Geophysical Research: Atmospheres, vol. 124, no. 4, pp. 2210–2234, 2019.
- X. Huang, C. Hu, X. Huang, Y. Chu, Y.-h. Tseng, G. J. Zhang, and Y. Lin, “A long-term tropical mesoscale convective systems dataset based on a novel objective automatic tracking algorithm,” Climate dynamics, vol. 51, pp. 3145–3159, 2018.
- L. T. Machado, M. Desbois, and J.-P. Duvel, “Structural characteristics of deep convective systems over tropical africa and the atlantic ocean,” Monthly Weather Review, vol. 120, no. 3, pp. 392–406, 1992.
- L. Machado, W. Rossow, R. Guedes, and A. Walker, “Life cycle variations of mesoscale convective systems over the americas,” Monthly Weather Review, vol. 126, no. 6, pp. 1630–1654, 1998.
- M. Kim, J. Im, H. Park, S. Park, M.-I. Lee, and M.-H. Ahn, “Detection of tropical overshooting cloud tops using himawari-8 imagery,” Remote sensing, vol. 9, no. 7, p. 685, 2017.
- Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no. 7553, pp. 436–444, 2015.
- C. Shi, Z. Su, K. Zhang, X. Xie, X. Zheng, Q. Lu, and J. Yang, “Cloudfu-net: A fine-grained segmentation method for ground-based cloud images based on an improved encoder-decoder structure,” IEEE Transactions on Geoscience and Remote Sensing, pp. 1–1, 2024.
- X. Ma, X. Zhang, M.-O. Pun, and M. Liu, “A multilevel multimodal fusion transformer for remote sensing semantic segmentation,” IEEE Transactions on Geoscience and Remote Sensing, 2024.
- O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation,” in Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 2015, pp. 234–241.
- L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-decoder with atrous separable convolution for semantic image segmentation,” in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 801–818.
- Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” in Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10 012–10 022.
- M. Reichstein, G. Camps-Valls, B. Stevens, M. Jung, J. Denzler, N. Carvalhais, and f. Prabhat, “Deep learning and process understanding for data-driven earth system science,” Nature, vol. 566, no. 7743, pp. 195–204, 2019.
- P. Wang, W. Li, P. Ogunbona, J. Wan, and S. Escalera, “Rgb-d-based human motion recognition with deep learning: A survey,” Computer vision and image understanding, vol. 171, pp. 118–139, 2018.
- S. Fang, Q. Zhang, G. Meng, S. Xiang, and C. Pan, “Gstnet: Global spatial-temporal network for traffic flow prediction.” in IJCAI, 2019, pp. 2286–2293.
- S. Jenni, G. Meishvili, and P. Favaro, “Video representation learning by recognizing temporal transformations,” in European Conference on Computer Vision. Springer, 2020, pp. 425–442.
- S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997.
- Y. Wang, M. Long, J. Wang, Z. Gao, and P. S. Yu, “Predrnn: Recurrent neural networks for predictive learning using spatiotemporal lstms,” Advances in neural information processing systems, vol. 30, 2017.
- Y. Wang, Z. Gao, M. Long, J. Wang, and S. Y. Philip, “Predrnn++: Towards a resolution of the deep-in-time dilemma in spatiotemporal predictive learning,” in International Conference on Machine Learning. PMLR, 2018, pp. 5123–5132.
- G. Bertasius, H. Wang, and L. Torresani, “Is space-time attention all you need for video understanding?” in ICML, vol. 2, no. 3, 2021, p. 4.
- A. Arnab, M. Dehghani, G. Heigold, C. Sun, M. Lučić, and C. Schmid, “Vivit: A video vision transformer,” in Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 6836–6846.
- Z. Liu, J. Ning, Y. Cao, Y. Wei, Z. Zhang, S. Lin, and H. Hu, “Video swin transformer,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 3202–3211.
- C. Tan, Z. Gao, S. Li, and S. Z. Li, “Simvp: Towards simple yet powerful spatiotemporal predictive learning,” arXiv preprint arXiv:2211.12509, 2022.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.
- J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440.
- C. Luo, S. Feng, Y. Quan, Y. Ye, X. Li, Y. Xu, B. Zhang, and Z. Chen, “Trcdnet: A transformer network for video cloud detection,” IEEE Transactions on Geoscience and Remote Sensing, 2023.
- L.-C. Chen, G. Papandreou, F. Schroff, and H. Adam, “Rethinking atrous convolution for semantic image segmentation,” arXiv preprint arXiv:1706.05587, 2017.
- H. Wang, W. Wang, and J. Liu, “Temporal memory attention for video semantic segmentation,” in 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2021, pp. 2254–2258.
- J. H. Jeppesen, R. H. Jacobsen, F. Inceoglu, and T. S. Toftegaard, “A cloud detection algorithm for satellite imagery based on deep learning,” Remote sensing of environment, vol. 229, pp. 247–259, 2019.
- J. Drönner, N. Korfhage, S. Egli, M. Mühling, B. Thies, J. Bendix, B. Freisleben, and B. Seeger, “Fast cloud segmentation using convolutional neural networks,” Remote Sensing, vol. 10, no. 11, p. 1782, 2018.
- Z. Gao, C. Tan, L. Wu, and S. Z. Li, “Simvp: Simpler yet better video prediction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 3170–3180.