Multi-modal learning for geospatial vegetation forecasting (2303.16198v2)
Abstract: The innovative application of precise geospatial vegetation forecasting holds immense potential across diverse sectors, including agriculture, forestry, humanitarian aid, and carbon accounting. To leverage the vast availability of satellite imagery for this task, various works have applied deep neural networks for predicting multispectral images in photorealistic quality. However, the important area of vegetation dynamics has not been thoroughly explored. Our study breaks new ground by introducing GreenEarthNet, the first dataset specifically designed for high-resolution vegetation forecasting, and Contextformer, a novel deep learning approach for predicting vegetation greenness from Sentinel 2 satellite images with fine resolution across Europe. Our multi-modal transformer model Contextformer leverages spatial context through a vision backbone and predicts the temporal dynamics on local context patches incorporating meteorological time series in a parameter-efficient manner. The GreenEarthNet dataset features a learned cloud mask and an appropriate evaluation scheme for vegetation modeling. It also maintains compatibility with the existing satellite imagery forecasting dataset EarthNet2021, enabling cross-dataset model comparisons. Our extensive qualitative and quantitative analyses reveal that our methods outperform a broad range of baseline techniques. This includes surpassing previous state-of-the-art models on EarthNet2021, as well as adapted models from time series forecasting and video prediction. To the best of our knowledge, this work presents the first models for continental-scale vegetation modeling at fine resolution able to capture anomalies beyond the seasonal cycle, thereby paving the way for predicting vegetation health and behaviour in response to climate variability and extremes.
- A machine-learning based ConvLSTM architecture for NDVI forecasting. International Transactions in Operational Research, 30(4):2025–2048, 2023.
- Geomorpho90m, empirical evaluation and accuracy assessment of global high-resolution geomorphometric layers. Scientific Data, 7(1):162, 2020.
- Beyond rgb: Very high resolution urban remote sensing with multimodal deep networks. ISPRS journal of photogrammetry and remote sensing, 140:20–32, 2018.
- CloudSEN12, a global dataset for semantic understanding of cloud and cloud shadow in Sentinel-2. Scientific Data, 9(1):782, 2022.
- Layer normalization. arXiv, 1607.06450, 2016.
- FitVid: Overfitting in Pixel-Level Video Prediction. arxiv, 2106.13195, 2021.
- Evaluation of daily precipitation analyses in e-obs (v19. 0e) and era5 by comparison to regional high-resolution datasets in european regions. International Journal of Climatology, 42(2):727–747, 2022.
- Narrow but robust advantages in two-big-leaf light use efficiency models over big-leaf light use efficiency models at ecosystem level. Agricultural and Forest Meteorology, 326:109185, 2022.
- Forecasting vegetation condition for drought early warning systems in pastoral communities in Kenya. Remote Sensing of Environment, 248:111886, 2020.
- Relational inductive biases, deep learning, and graph networks. arxiv, 1806.01261, 2018.
- Vitus Benson. Code and pre-trained model weights for benson et. al., CVPR (2024) - multi-modal learning for geospatial vegetation forecasting. Zenodo, 10.5281/zenodo.10793870, 2024.
- Accurate medium-range global weather forecasting with 3d neural networks. Nature, 619(7970):533–538, 2023.
- Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery. Advances in Neural Information Processing Systems, 35:197–211, 2022.
- An Ensemble Version of the E-OBS Temperature and Precipitation Data Sets. Journal of Geophysical Research: Atmospheres, 123(17):9391–9409, 2018.
- NASADEM global elevation model: Methods and progress. In The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, pages 125–128. Copernicus GmbH, 2016.
- Forecasting ndvi in multiple complex areas using neural network techniques combined feature engineering. International Journal of Digital Earth, 13(12):1733–1749, 2020.
- Challenges and opportunities of multimodality and data fusion in remote sensing. Proceedings of the IEEE, 103(9):1585–1601, 2015.
- Understanding the Role of Weather Data for Earth Surface Forecasting Using a ConvLSTM-Based Model. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1362–1371, 2022.
- KappaMask: AI-Based Cloudmask Processor for Sentinel-2. Remote Sensing, 13(20):4100, 2021.
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations, 2020.
- ESA. Copernicus DEM - Global and European Digital Elevation Model (COP-DEM). 2021.
- Forecasting vegetation indices from spatio-temporal remotely sensed data using deep learning-based approaches: A systematic literature review. Ecological Informatics, page 101552, 2022.
- Earthformer: Exploring Space-Time Transformers for Earth System Forecasting. In Advances in Neural Information Processing Systems, 2022a.
- SimVP: Simpler Yet Better Video Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3170–3180, 2022b.
- Maskvit: Masked visual pre-training for video prediction. In The Eleventh International Conference on Learning Representations, 2023.
- Masked Autoencoders Are Scalable Vision Learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16000–16009, 2022.
- Gaussian error linear units (gelus). arXiv, 1606.08415, 2023.
- Darts: User-Friendly Modern Machine Learning for Time Series. Journal of Machine Learning Research, 23(124):1–6, 2022.
- Long Short-Term Memory. Neural Computation, 9(8):1735–1780, 1997.
- ICOS RI. Ecosystem final quality (l2) product in etc-archive format - release 2022-1. station de-gri. 2022.
- Perceiver: General perception with iterative attention. In International conference on machine learning, pages 4651–4664. PMLR, 2021.
- Forecasting vegetation greenness with satellite and climate data. IEEE Geoscience and Remote Sensing Letters, 1(1):3–6, 2004.
- Forecasting vegetation index based on vegetation-meteorological factor interactions with artificial neural network. In 2016 Fifth International Conference on Agro-Geoinformatics (Agro-Geoinformatics), pages 1–6. IEEE, 2016.
- LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017.
- Enhanced prediction of vegetation responses to extreme drought using deep learning and earth observation data. Ecological Informatics, 80:102474, 2024.
- Wildfire Danger Prediction and Understanding With Deep Learning. Geophysical Research Letters, 49(17):e2022GL099368, 2022.
- Identifying Dynamic Memory Effects on Vegetation State Using Recurrent Neural Networks. Frontiers in Big Data, 2, 2019.
- Learning skillful medium-range global weather forecasting. Science, page eadi2336, 2023.
- Stochastic Adversarial Video Prediction. arxiv, 1804.01523, 2018.
- Deep Learning for Vegetation Health Forecasting: A Case Study in Kenya. Remote Sensing, 14(3):698, 2022a.
- Deep learning for vegetation health forecasting: a case study in kenya. Remote Sensing, 14(3):698, 2022b.
- Deep learning in multimodal remote sensing data fusion: A comprehensive review. International Journal of Applied Earth Observation and Geoinformation, 112:102926, 2022.
- MMST-ViT: Climate Change-aware Crop Yield Prediction via Multi-Modal Spatial-Temporal Vision Transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5774–5784, 2023.
- Data Cubes for Earth System Research: Challenges Ahead. EarthArXiv, 5649, 2023.
- Decoupled Weight Decay Regularization. In International Conference on Learning Representations, 2022.
- SENTINEL-2 SEN2COR: L2A Processor for Users. In Proceedings Living Planet Symposium 2016, pages 1–8, Prague, Czech Republic, 2016. Spacebooks Online.
- A crossmodal multiscale fusion network for semantic segmentation of remote sensing data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15:3463–3474, 2022a.
- Forecasting vegetation dynamics in an open ecosystem by integrating deep learning and environmental variables. International Journal of Applied Earth Observation and Geoinformation, 114:103060, 2022b.
- Learning extreme vegetation response to climate forcing: A comparison of recurrent neural network architectures. EGUsphere, 2023:1–32, 2023.
- Towards Geospatial Foundation Models via Continual Pretraining. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 16806–16816, 2023.
- Cloud removal in Sentinel-2 imagery using a deep residual neural network and SAR-optical data fusion. ISPRS Journal of Photogrammetry and Remote Sensing, 166:333–346, 2020.
- Prediction of grass biomass from satellite imagery in Somali regional state, eastern Ethiopia. Heliyon, 6(10), 2020.
- Transframer: Arbitrary frame prediction with generative models. Trans. Mach. Learn. Res., 2023, 2022.
- River flow forecasting through conceptual models part I — A discussion of principles. Journal of Hydrology, 10(3):282–290, 1970.
- Climax: A foundation model for weather and climate. In 1st Workshop on the Synergy of Scientific and Machine Learning Modeling @ ICML2023, 2023.
- On the potential of Sentinel-2 for estimating Gross Primary Production. IEEE Transactions on Geoscience and Remote Sensing, pages 1–1, 2022.
- FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators. AI for Earth and Space Science, workshop at ICLR 2022, 2202.11214, 2022.
- FiLM: Visual Reasoning with a General Conditioning Layer. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1), 2018.
- Fmask 4.0: Improved cloud and cloud shadow detection in Landsats 4–8 and Sentinel-2 imagery. Remote Sensing of Environment, 231:111205, 2019.
- WeatherBench: A Benchmark Data Set for Data-Driven Weather Forecasting. Journal of Advances in Modeling Earth Systems, 12(11):e2020MS002203, 2020.
- Skilful precipitation nowcasting using deep generative models of radar. Nature, 597(7878):672–677, 2021.
- Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4088–4099, 2023.
- Predicting Landscapes from Environmental Conditions Using Generative Networks. In Pattern Recognition, pages 203–217, Cham, 2019. Springer International Publishing.
- EarthNet2021: A large-scale dataset and challenge for Earth surface forecasting as a guided video prediction task. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 1132–1142, 2021.
- Learning to forecast vegetation greenness at fine resolution over Africa with ConvLSTMs. Climate Change AI workshop at NeurIPS 2022, 2022.
- High-Resolution Image Synthesis With Latent Diffusion Models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022.
- Phenological maps of Europe. Climate Research, 18(3):249–257, 2001.
- MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4510–4520, 2018.
- Focal-TSMP: Deep learning for vegetation health prediction and agricultural drought assessment from a regional climate simulation. EGUsphere, pages 1–50, 2023.
- Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2015.
- Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017.
- EarthPT: A foundation model for Earth Observation. arxiv, 2309.07207, 2023.
- Combining sentinel-1 and sentinel-2 data for improved land use and land cover mapping of monsoon regions. International journal of applied earth observation and geoinformation, 73:595–604, 2018.
- U-TILISE: A Sequence-to-Sequence Model for Cloud Removal in Optical Satellite Time Series. 61:1–16.
- Satellite data reveal differential responses of Swiss forests to unprecedented 2018 drought. Global Change Biology, 28(9):2956–2978, 2022.
- Generation of the 30 m mesh global digital surface model by ALOS Prism. ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLI-B4:157–162, 2016.
- SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning. arxiv, 2211.12509, 2023.
- Forecasting at Scale. The American Statistician, 72(1):37–45, 2018.
- Lightweight, Pre-trained Transformers for Remote Sensing Timeseries. arxiv, 2304.14065, 2023.
- Attention is All you Need. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017.
- SAR-to-Optical Image Translation Using Supervised Cycle-Consistent Adversarial Networks. IEEE Access, 7:129136–129149, 2019.
- Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 568–578, 2021.
- PVT v2: Improved baselines with Pyramid Vision Transformer. Computational Visual Media, 8(3):415–424, 2022.
- PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017.
- PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2):2208–2225, 2023.
- A new synergistic approach for monitoring wetlands using sentinels-1 and 2 data with object-based machine learning algorithms. Environmental Modelling & Software, 104:40–54, 2018.
- Estimating crop primary productivity with Sentinel-2 and Landsat 8 using machine learning methods trained with radiative transfer simulations. Remote Sensing of Environment, 225:441–457, 2019.
- Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting. In Advances in Neural Information Processing Systems, pages 22419–22430. Curran Associates, Inc., 2021a.
- MotionRNN: A Flexible Model for Video Prediction With Spacetime-Varying Motions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15435–15444, 2021b.
- Group Normalization. In Proceedings of the European Conference on Computer Vision (ECCV), pages 3–19, 2018.
- EarthNets: Empowering AI in Earth Observation. arxiv, 2210.04936, 2022.
- Empirical Evaluation of Rectified Activations in Convolutional Network. arxiv, 1505.00853, 2015.
- Deep Residual Network with Multi-Image Attention for Imputing Under Clouds in Satellite Imagery. In 2022 26th International Conference on Pattern Recognition (ICPR), pages 643–649, 2022.
- RingMo-Sense: Remote Sensing Foundation Model for Spatiotemporal Prediction via Spatiotemporal Evolution Disentangling. IEEE Transactions on Geoscience and Remote Sensing, pages 1–1, 2023.
- ESA WorldCover 10 m 2020 v100. 2021.
- Are Transformers Effective for Time Series Forecasting? Proceedings of the AAAI Conference on Artificial Intelligence, 37(9):11121–11128, 2023.
- Optical vegetation indices for monitoring terrestrial ecosystems globally. Nature Reviews Earth & Environment, pages 1–17, 2022.