Deep Generative Data Assimilation in Multimodal Setting (2404.06665v3)
Abstract: Robust integration of physical knowledge and data is key to improve computational simulations, such as Earth system models. Data assimilation is crucial for achieving this goal because it provides a systematic framework to calibrate model outputs with observations, which can include remote sensing imagery and ground station measurements, with uncertainty quantification. Conventional methods, including Kalman filters and variational approaches, inherently rely on simplifying linear and Gaussian assumptions, and can be computationally expensive. Nevertheless, with the rapid adoption of data-driven methods in many areas of computational sciences, we see the potential of emulating traditional data assimilation with deep learning, especially generative models. In particular, the diffusion-based probabilistic framework has large overlaps with data assimilation principles: both allows for conditional generation of samples with a Bayesian inverse framework. These models have shown remarkable success in text-conditioned image generation or image-controlled video synthesis. Likewise, one can frame data assimilation as observation-conditioned state calibration. In this work, we propose SLAMS: Score-based Latent Assimilation in Multimodal Setting. Specifically, we assimilate in-situ weather station data and ex-situ satellite imagery to calibrate the vertical temperature profiles, globally. Through extensive ablation, we demonstrate that SLAMS is robust even in low-resolution, noisy, and sparse data settings. To our knowledge, our work is the first to apply deep generative framework for multimodal data assimilation using real-world datasets; an important step for building robust computational simulators, including the next-generation Earth system models. Our code is available at: https://github.com/yongquan-qu/SLAMS
- Deep data assimilation: integrating deep learning with data assimilation. Applied Sciences, 11(3):1114, 2021.
- Layer normalization. arXiv preprint arXiv:1607.06450, 2016.
- Variational autoencoder or generative adversarial networks? a comparison of two deep learning methods for flow and transport data assimilation. Mathematical Geosciences, 54(6):1017–1042, 2022.
- Conditional image generation with score-based diffusion models. arXiv preprint arXiv:2111.13606, 2021.
- History-based, bayesian, closure for stochastic parameterization: Application to lorenz’96. arXiv preprint arXiv:2210.14488, 2022.
- Combining data assimilation and machine learning to infer unresolved scale parametrization. Philosophical Transactions of the Royal Society A, 379(2194):20200086, 2021.
- Data learning: Integrating data assimilation and machine learning. Journal of Computational Science, 58:101525, 2022.
- Data assimilation in the geosciences: An overview of methods, issues, and perspectives. Wiley Interdisciplinary Reviews: Climate Change, 9(5):e535, 2018.
- Assessing objective techniques for gauge-based analyses of global daily precipitation. Journal of Geophysical Research: Atmospheres, 113(D4), 2008.
- Autodifferentiable ensemble kalman filters. SIAM Journal on Mathematics of Data Science, 4(2):801–833, 2022.
- Generalised latent assimilation in heterogeneous reduced spaces with machine learning surrogate models. Journal of Scientific Computing, 94(1):11, 2023.
- Diffusion posterior sampling for general noisy inverse problems. arXiv preprint arXiv:2209.14687, 2022.
- Bradley Efron. Tweedie’s formula and selection bias. Journal of the American Statistical Association, 106(496):1602–1614, 2011.
- Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural networks, 107:3–11, 2018.
- Online model error correction with neural networks in the incremental 4d-var framework. Journal of Advances in Modeling Earth Systems, 15(9):e2022MS003474, 2023.
- Towards artificial general intelligence via a multimodal foundation model. Nature Communications, 13(1):3094, 2022.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020.
- Diffda: a diffusion model for weather-scale data assimilation. arXiv preprint arXiv:2401.05932, 2024.
- An investigation of the sensitivity of the clear-sky outgoing longwave radiation to atmospheric temperature and water vapor. Journal of Geophysical Research: Atmospheres, 112(D5), 2007.
- Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning, pages 448–456. pmlr, 2015.
- Position, padding and predictions: A deeper look at position information in cnns. arXiv preprint arXiv:2101.12322, 2021.
- On the representation error in data assimilation. Quarterly Journal of the Royal Meteorological Society, 144(713):1257–1278, 2018.
- Neural general circulation models. arXiv preprint arXiv:2311.07222, 2023.
- Data assimilation: making sense of earth observation. Frontiers in Environmental Science, 2:16, 2014.
- A machine learning approach to the observation operator for satellite radiance data assimilation. Journal of the Meteorological Society of Japan. Ser. II, 101(1):79–95, 2023.
- Description of a complete (interpolated) outgoing longwave radiation dataset. Bulletin of the American Meteorological Society, 77(6):1275–1277, 1996.
- Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101, 2017.
- Towards a foundation model for geospatial artificial intelligence (vision paper). In Proceedings of the 30th International Conference on Advances in Geographic Information Systems, pages 1–4, 2022.
- On the opportunities and challenges of foundation models for geospatial artificial intelligence. arXiv preprint arXiv:2304.06798, 2023.
- Thermal equilibrium of the atmosphere with a convective adjustment. Journal of the Atmospheric Sciences, 21(4):361–385, 1964.
- Thermal equilibrium of the atmosphere with a given distribution of relative humidity. 1967.
- Deep learning for latent space data assimilation in subsurface flow systems. SPE Journal, 27(05):2820–2840, 2022.
- Gaspard Monge. Mémoire sur la théorie des déblais et des remblais. Mem. Math. Phys. Acad. Royale Sci., pages 666–704, 1781.
- Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), pages 807–814, 2010.
- Metaflux: Meta-learning global carbon fluxes from sparse spatiotemporal observations. Scientific Data, 10(1):440, 2023.
- Chaosbench: A multi-channel, physics-based benchmark for subseasonal-to-seasonal climate prediction. arXiv preprint arXiv:2402.00712, 2024.
- Conditional image-to-video generation with latent flow diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 18444–18455, 2023.
- The scenario model intercomparison project (scenariomip) for cmip6. Geoscientific Model Development, 9(9):3461–3482, 2016.
- Latent space data assimilation by using deep learning. Quarterly Journal of the Royal Meteorological Society, 147(740):3759–3777, 2021.
- Can a machine learning–enabled numerical model help extend effective forecast range through consistently trained subgrid-scale models? Artificial Intelligence for the Earth Systems, 2(1):e220050, 2023.
- Joint parameter and parameterization inference with uncertainty quantification through differentiable programming. arXiv preprint arXiv:2403.02215, 2024.
- Deep learning and process understanding for data-driven earth system science. Nature, 566(7743):195–204, 2019.
- Kalmannet: Neural network aided kalman filtering for partially known dynamics. IEEE Transactions on Signal Processing, 70:1532–1547, 2022.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
- Score-based data assimilation. arXiv preprint arXiv:2306.10574, 2023.
- Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems, 35:36479–36494, 2022.
- Applied stochastic differential equations. Cambridge University Press, 2019.
- Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020.
- Neural discrete representation learning. Advances in neural information processing systems, 30, 2017.
- Pascal Vincent. A connection between score matching and denoising autoencoders. Neural computation, 23(7):1661–1674, 2011.
- A multimodel data assimilation framework via the ensemble kalman filter. Water Resources Research, 50(5):4197–4219, 2014.
- Fast sampling of diffusion models with exponential integrator. arXiv preprint arXiv:2204.13902, 2022.
- Defeat: Deep hidden feature backdoor attacks by imperceptible perturbation and latent representation constraints. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15213–15222, 2022.