Spatio-temporal DeepKriging for Interpolation and Probabilistic Forecasting (2306.11472v1)
Abstract: Gaussian processes (GP) and Kriging are widely used in traditional spatio-temporal mod-elling and prediction. These techniques typically presuppose that the data are observed from a stationary GP with parametric covariance structure. However, processes in real-world applications often exhibit non-Gaussianity and nonstationarity. Moreover, likelihood-based inference for GPs is computationally expensive and thus prohibitive for large datasets. In this paper we propose a deep neural network (DNN) based two-stage model for spatio-temporal interpolation and forecasting. Interpolation is performed in the first step, which utilizes a dependent DNN with the embedding layer constructed with spatio-temporal basis functions. For the second stage, we use Long-Short Term Memory (LSTM) and convolutional LSTM to forecast future observations at a given location. We adopt the quantile-based loss function in the DNN to provide probabilistic forecasting. Compared to Kriging, the proposed method does not require specifying covariance functions or making stationarity assumption, and is computationally efficient. Therefore, it is suitable for large-scale prediction of complex spatio-temporal processes. We apply our method to monthly $PM_{2.5}$ data at more than $200,000$ space-time locations from January 1999 to December 2022 for fast imputation of missing values and forecasts with uncertainties.
- The second competition on spatial statistics for large datasets. Journal of Data Science 20(4), 439–460.
- Amari, S.-i. (1993). Backpropagation and stochastic gradient descent method. Neurocomputing 5(4-5), 185–196.
- A novel framework for spatio-temporal prediction of environmental data using deep learning. Scientific reports 10(1), 1–11.
- The statistics of peaks of Gaussian random fields. The Astrophysical Journal 304, 15–61.
- Bartlett, M. S. (2013). The statistical analysis of spatial pattern, Volume 15. Springer Science & Business Media.
- Machine learning for data-driven discovery in solid earth geoscience. Science 363(6433), eaau0323.
- Binkowski, F. S. and S. J. Roselle (2003). Models-3 community multiscale air quality (CMAQ) model aerosol component 1. model description. Journal of Geophysical Research: Atmospheres 108, D6.
- Spatial and spatio-temporal models with r-inla. Spatial and spatio-temporal epidemiology 4, 33–49.
- A survey on multi-output regression. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 5(5), 216–233.
- A simple non-separable, non-stationary spatiotemporal model for ozone. Environmental and ecological statistics 16, 515–529.
- Buhmann, M. D. (2000). Radial basis functions. Acta numerica 9, 1–38.
- Financial time series forecasting model based on CEEMDAN and LSTM. Physica A: Statistical mechanics and its applications 519, 127–139.
- Space-time covariance structures and models. Annual Review of Statistics and Its Application 8, 191–215.
- Deepkriging: Spatially dependent deep neural networks for spatial prediction. Accepted, Statistica Sinica 1, to appear.
- Chimmula, V. K. R. and L. Zhang (2020). Time series forecasting of COVID-19 transmission in Canada using LSTM networks. Chaos, Solitons & Fractals 135, 109864.
- L2 regularization for learning kernels.
- Cressie, N. and H.-C. Huang (1999). Classes of nonseparable, spatio-temporal stationary covariance functions. Journal of the American Statistical Association 94(448), 1330–1339.
- Fixed rank kriging for very large spatial data sets. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 70(1), 209–226.
- Cressie, N. and C. K. Wikle (2015). Statistics for spatio-temporal data. John Wiley & Sons.
- An overview of univariate and multivariate Karhunen Loève Expansions in statistics. Journal of the Indian Society for Probability and Statistics 23, 285–326.
- Space–time analysis using a general product–sum model. Statistics & Probability Letters 52(1), 21–28.
- Time series forecasting using lstm networks: A symbolic approach.
- Solar irradiation prediction with machine learning: Forecasting models selection method depending on weather variability. Energy 165, 620–629.
- A class of nonseparable and nonstationary spatial temporal covariance functions. Environmetrics: The official journal of the International Environmetrics Society 19(5), 487–507.
- A review of explainable and interpretable ai with applications in covid-19 imaging. Medical Physics 49(1), 1–14.
- Gneiting, T. (2002). Nonseparable, stationary covariance functions for space–time data. Journal of the American Statistical Association 97(458), 590–600.
- Probabilistic forecasting. Annual Review of Statistics and Its Application 1, 125–151.
- GpGp: fast Gaussian process computation using Vecchia’s approximation. R. package version 0.4. 0.
- The elements of statistical learning. Springer series in statistics. Springer.
- Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14(8), 2.
- Long short-term memory. Neural computation 9(8), 1735–1780.
- Forecasting high-frequency spatio-temporal wind power with dimensionally reduced echo state networks.
- Huang, H.-C. and N.-J. Hsu (2004). Modeling transport effects on ground-level ozone using a non-stationary space–time model. Environmetrics: The official journal of the International Environmetrics Society 15(3), 251–268.
- Towards neural earth system modelling by integrating artificial intelligence in earth system science. Nature Machine Intelligence 3(8), 667–674.
- Jaeger, H. (2007). Echo state network. scholarpedia 2(9), 2330.
- Ketkar, N. (2017). Stochastic gradient descent. In Deep learning with Python, pp. 113–132. Springer.
- Spatio-temporal graph deep neural network for short-term wind speed forecasting. IEEE Transactions on Sustainable Energy 10(2), 670–681.
- Koenker, R. (2004). Quantile regression for longitudinal data. Journal of Multivariate Analysis 91(1), 74–89.
- Regression quantiles. Econometrica 46(1), 33–50.
- Quantile smoothing splines. Biometrika 81(4), 673–680.
- Methods for generating non-separable spatiotemporal covariance models with potential environmental applications. Advances in Water Resources 27(8), 815–830.
- Improving the accuracy of rainfall rates from optical satellite sensors with machine learning—a random forests-based approach applied to MSG SEVIRI. Remote Sensing of Environment 141, 129–143.
- Lau, M. M. and K. H. Lim (2018). Review of adaptive activation function in deep neural network. In 2018 IEEE-EMBS Conference on Biomedical Engineering and Sciences (IECBES), pp. 686–690. IEEE.
- L1-norm quantile regression. Journal of Computational and Graphical Statistics 17(1), 163–185.
- Smart deep learning based wind speed prediction model using wavelet packet decomposition, convolutional neural network and convolutional long short term memory network. Energy Conversion and Management 166, 120–131.
- Deep neural network based feature representation for weather forecasting. In Proceedings on the International Conference on Artificial Intelligence (ICAI), pp. 1. The Steering Committee of The World Congress in Computer Science.
- Application of deep convolutional neural networks for detecting extreme weather in climate datasets.
- Reservoir computing approaches to recurrent neural network training. Computer science review 3(3), 127–149.
- Ma, C. (2002). Spatio-temporal covariance functions generated by mixtures. Mathematical geology 34, 965–975.
- Valid model-free spatial prediction. Journal of the American Statistical Association 0, 1–28.
- McDermott, P. L. and C. K. Wikle (2017). An ensemble quadratic echo state network for non-linear spatio-temporal forecasting. Stat 6(1), 315–330.
- Recurrent neural networks. Design and Applications 5, 64–67.
- The explanation game: Explaining machine learning models using shapley values. In Machine Learning and Knowledge Extraction: 4th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 International Cross-Domain Conference, CD-MAKE 2020, Dublin, Ireland, August 25–28, 2020, Proceedings 4, pp. 17–38. Springer.
- Quantifying the air quality and health benefits of greening freight movements. Environmental Research 183, 109193.
- Spatial and spatio-temporal geostatistical modeling and kriging. John Wiley & Sons.
- Learning multiple quantiles with neural networks. Journal of Computational and Graphical Statistics 30(4), 1238–1248.
- Deep learning applications and challenges in big data analytics. Journal of Big Data 2(1), 1–21.
- A multiresolution Gaussian process model for the analysis of large spatial datasets. Journal of Computational and Graphical Statistics 24(2), 579–599.
- Emergency admissions for cardiovascular and respiratory diseases and the chemical composition of fine particle air pollution. Environmental health perspectives 117(6), 957–963.
- Increased particulate air pollution and the triggering of myocardial infarction. Circulation 103(23), 2810–2815.
- New classes of covariance and spectral density functions for spatio-temporal modelling. Stochastic Environmental Research and Risk Assessment 22(Suppl 1), 65–79.
- Sigmoid activation function in selecting the best model of artificial neural networks. In Journal of Physics: Conference Series, Volume 1471, pp. 012010. IOP Publishing.
- Time series forecasting of petroleum production using deep lstm recurrent networks. Neurocomputing 323, 203–213.
- High performance multivariate geospatial statistics on manycore systems. IEEE Transactions on Parallel and Distributed Systems 32(11), 2719–2733.
- Schmidt-Hieber, J. (2020). Nonparametric regression using deep neural networks with ReLU activation function. The Annals of Statistics 48(4), 1875–1897.
- Convolutional lstm network: A machine learning approach for precipitation nowcasting. In C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.), Advances in Neural Information Processing Systems, Volume 28. Curran Associates, Inc.
- A dynamic nonstationary spatio-temporal model for short term prediction of precipitation. The Annals of Applied Statistics 6(4), 1452 – 1477.
- Stein, M. L. (2005). Space–time covariance functions. Journal of the American Statistical Association 100(469), 310–321.
- Dynamic models for spatiotemporal data. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 63(4), 673–689.
- Geostatistics for large datasets. In E. Porcu, J.-M. Montero, and M. Schlather (Eds.), Advances and Challenges in Space-time Modelling of Natural Events, Berlin, Heidelberg, pp. 55–77. Springer Berlin Heidelberg.
- Vidakovic, B. (2009). Statistical modeling by wavelets, Volume 503. John Wiley & Sons.
- A survey of l1 regression. International Statistical Review 81(3), 361–387.
- Wahba, G. (1990). Spline models for observational data. SIAM.
- Hierarchical bayesian space-time models. Environmental and ecological statistics 5, 117–154.
- Statistical deep learning for spatial and spatio-temporal data.
- Wu, J. (2017). Introduction to convolutional neural networks. National Key Lab for Novel Software Technology. Nanjing University. China 5(23), 495.
- Improved latent space approach for modelling non-stationary spatial–temporal random fields. Spatial Statistics 23, 160–181.
- Zafar, M. R. and N. M. Khan (2019). Dlime: A deterministic local interpretable model-agnostic explanations approach for computer-aided diagnosis systems.
- Frk: An r package for spatial and spatio-temporal prediction with large datasets. Journal of Statistical Software 98, 1–48.
- Zammit-Mangion, A. and C. K. Wikle (2020). Deep integro-difference equation models for spatio-temporal forecasting. Spatial Statistics 37, 100408.
- Joint deep learning for land cover and land use classification. Remote sensing of environment 221, 173–187.
- Missing data reconstruction in remote sensing image with a unified spatial–temporal–spectral deep convolutional neural network. IEEE Transactions on Geoscience and Remote Sensing 56(8), 4274–4288.