A Statistical Learning View of Simple Kriging (2202.07365v5)
Abstract: In the Big Data era, with the ubiquity of geolocation sensors in particular, massive datasets exhibiting a possibly complex spatial dependence structure are becoming increasingly available. In this context, the standard probabilistic theory of statistical learning does not apply directly and guarantees of the generalization capacity of predictive rules learned from such data are left to establish. We analyze here the simple Kriging task from a statistical learning perspective, i.e. by carrying out a nonparametric finite-sample predictive analysis. Given $d\geq 1$ values taken by a realization of a square integrable random field $X={X_s}{s\in S}$, $S\subset \mathbb{R}2$, with unknown covariance structure, at sites $s_1,\; \ldots,\; s_d$ in $S$, the goal is to predict the unknown values it takes at any other location $s\in S$ with minimum quadratic risk. The prediction rule being derived from a training spatial dataset: a single realization $X'$ of $X$, independent from those to be predicted, observed at $n\geq 1$ locations $\sigma_1,\; \ldots,\; \sigma_n$ in $S$. Despite the connection of this minimization problem with kernel ridge regression, establishing the generalization capacity of empirical risk minimizers is far from straightforward, due to the non independent and identically distributed nature of the training data $X'{\sigma_1},\; \ldots,\; X'{\sigma_n}$ involved in the learning procedure. In this article, non-asymptotic bounds of order $O{\mathbb{P}}(1/\sqrt{n})$ are proved for the excess risk of a plug-in predictive rule mimicking the true minimizer in the case of isotropic stationary Gaussian processes, observed at locations forming a regular grid in the learning stage. These theoretical results are illustrated by various numerical experiments, on simulated data and on real-world datasets.
- Concentration Inequalities for Sums and Martingales. Springer, Cham, 2015. doi: 10.1007/978-3-319-22099-4.
- Concentration Inequalities: A Nonasymptotic Theory of Independence. OUP Oxford, London, U.K., 2013.
- Time Series: Theory and Methods. Springer, New York, NY, 1987.
- Geostatistics: Modeling Spatial Uncertainty. Wiley, New York, 1999. ISBN 0471083151 9780471083153.
- Statistical Learning Based on Markovian Data: Maximal Deviation Inequalities and Learning Rates. The Annals of Mathematics and Artificial Intelligence, 2019.
- Cressie, N. Statistics for Spatial Data, pp. 1–26. John Wiley and Sons, Ltd, New York, NY, 1993. ISBN 9781119115151. doi: 10.1002/9781119115151.ch1.
- A Probabilistic Theory of Pattern Recognition, volume 31. Springer Science & Business Media, New York, NY, 1996.
- Non Parametric Estimation of Smooth Stationary Covariance Functions by Interpolation Methods. Statistical Inference for Stochastic Processes, 11(2):177–205, June 2008. doi: 10.1007/s11203-007-9014-z.
- Spatial Statistics and Modeling. Springer Series in Statistics. Springer New York, New York, NY, 2009. ISBN 9780387922577.
- Golubov, B. I. On Abel—Poisson Type and Riesz Means. Analysis Mathematica, 7(3):161–184, Sep 1981. doi: 10.1007/BF01908520.
- A Distribution-Free Theory of Nonparametric Regression. Springer, New York, NY, 2002.
- Properties of Nonparametric Estimators of Autocovariance for Stationary Random Fields. Probability Theory and Related Fields, 99(3):399–424, Sep 1994. doi: 10.1007/BF01199899.
- On the Nonparametric Estimation of Covariance Functions. The Annals of Statistics, 22(4):2115–2134, 1994.
- Hanneke, S. Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes. arXiv:1706.01418, 2017.
- Harville, D. A. Matrix Algebra From a Statistician’s Perspective. Technometrics, 40(2):164–164, 1998. doi: 10.1080/00401706.1998.10485214. URL https://doi.org/10.1080/00401706.1998.10485214.
- Matrix Analysis. Cambridge university press, Cambridge, U.K., 2012.
- Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences. arXiv preprint arXiv:1807.02582, 2018.
- Krige, D. G. A Statistical Approach to Some Basic Mine Valuation Problems on the Witwatersrand. Journal of the Southern African Institute of Mining and Metallurgy, 52(6):119–139, 1951.
- Generalization Bounds for Time Series Prediction with Non-stationary Processes. In Proceedings of ALT’14, 2014.
- Lahiri, S. Asymptotic Distribution of the Empirical Spatial Cumulative Distribution Function Predictor and Prediction Bands based on a Subsampling Method. Probability Theory and Related Fields, 114(1):55–84, 1999. doi: 10.1007/s004400050221.
- Prediction of Spatial Cumulative Distribution Functions using Subsampling. Journal of the American Statistical Association, 94(445):86–97, 1999. doi: 10.1080/01621459.1999.10473821.
- Learning Subgaussian Classes: Upper and Minimax Bounds. 2016.
- Loukas, A. How Close Are the Eigenvectors of the Sample and Actual Covariance Matrices? In Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pp. 2228–2237, International Convention Centre, Sydney, 06–11 Aug 2017. PMLR. URL https://proceedings.mlr.press/v70/loukas17a.html.
- Risk Minimization by Median-of-means Tournaments. arXiv preprint arXiv:1608.00757, 2016.
- Maximum Likelihood Estimation of Models for Residual Covariance in Spatial Regression. Biometrika, 71(1):135–146, 04 1984. ISSN 0006-3444. doi: 10.1093/biomet/71.1.135.
- Matheron, G. Traité de Géostatistique Appliquée. Tome 1. Number 14 in Mémoires du BRGM. Tecnip, Paris, 1962.
- Optimal Designs for Variogram Estimation. Environmetrics, 10:23–37, 1999.
- Geostat-framework/gstools. zenodo., 2020.
- On the Effect of Covariance Function Estimation on the Accuracy of Kriging Predictors. Bernoulli, 7(3):421–438, 06 2001.
- Compressive Kriging Using Multi-Dimensional Generalized Nested Sampling. In 2018 52nd Asilomar Conference on Signals, Systems, and Computers, pp. 84–88, 2018. doi: 10.1109/ACSSC.2018.8645258.
- Nonparametric Estimation of the Moments of a General Statistic Computed from Spatial Data. Journal of the American Statistical Association, 89(426):496–500, 1994. doi: 10.1080/01621459.1994.10476773.
- Shi, L. Bounds on the (Laplacian) Spectral Radius of Graphs. Linear algebra and its applications, 422(2-3):755–770, 2007.
- Spielman, D. A. Spectral Graph Theory, 2012. URL http://www.cs.yale.edu/homes/spielman/561/2012/index.html. Lecture 3, The Adjacency Matrix and the nth Eigenvalue.
- Affine-Invariant Integrated Rank-Weighted Depth: Definition, Properties and Finite Sample Analysis. arXiv preprint arXiv:2106.11068, 2021.
- Stein, M. L. Interpolation of Spatial Data. Springer Series in Statistics. Springer-Verlag, New York, 1999. ISBN 0-387-98629-4. doi: 10.1007/978-1-4612-1494-6. Some theory for Kriging.
- Support Vector Machines. Springer Science & Business Media, New York, NY, 2008.
- Fast Learning from non-i.i.d. Observations. NIPS, pp. 1768–1776, 2009.
- Learning from Dependent Observations. Journal of Multivariate Analysis, 100(1):175–194, 2009.
- Tikhonov, A. On the stability of inverse problems. Dokl. Akad. Nauk SSSR, 39(5):195–198, 1943.
- Vershynin, R. How Close is the Sample Covariance Matrix to the Actual Covariance Matrix? Journal of Theoretical Probability, 25(3):655–686, 2012.
- Some Inequalities for the Eigenvalues of the Product of Positive Semidefinite Hermitian Matrices. Linear algebra and its applications, 160:113–118, 1992. doi: 10.1016/0024-3795(92)90442-D.
- Tail Bounds for Sum of Gamma Variables and Related Inferences. Communications in Statistics - Theory and Methods, 0(0):1–10, 2020. doi: 10.1080/03610926.2020.1756329.
- Wedin, P.-Å. Perturbation Theory for Pseudo-Inverses. BIT Numerical Mathematics, 13(2):217–232, 1973.
- Inequalities for Selected Eigenvalues of the Product of Matrices. arXiv preprint arXiv:1905.03821, 2019. doi: 10.1090/proc/14529.
- Mean Squared Prediction Error in the Spatial Linear Model with Estimated Covariance Parameters. Annals of the Institute of Statistical Mathematics, 44:27–43, 02 1992. doi: 10.1007/BF00048668.
- Zimmerman, D. L. Computationally Exploitable Structure of Covariance Matrices and Generalized Covariance Matrices in Spatial Models. Journal of Statistical Computation and Simulation, 32(1-2):1–15, 1989. doi: 10.1080/00949658908811149.