Why not a thin plate spline for spatial models? A comparative study using Bayesian inference (2404.12756v1)
Abstract: Spatial modelling often uses Gaussian random fields to capture the stochastic nature of studied phenomena. However, this approach incurs significant computational burdens (O(n3)), primarily due to covariance matrix computations. In this study, we propose to use a low-rank approximation of a thin plate spline as a spatial random effect in Bayesian spatial models. We compare its statistical performance and computational efficiency with the approximated Gaussian random field (by the SPDE method). In this case, the dense matrix of the thin plate spline is approximated using a truncated spectral decomposition, resulting in computational complexity of O(kn2) operations, where k is the number of knots. Bayesian inference is conducted via the Hamiltonian Monte Carlo algorithm of the probabilistic software Stan, which allows us to evaluate performance and diagnostics for the proposed models. A simulation study reveals that both models accurately recover the parameters used to simulate data. However, models using a thin plate spline demonstrate superior execution time to achieve the convergence of chains compared to the models utilizing an approximated Gaussian random field. Furthermore, thin plate spline models exhibited better computational efficiency for simulated data coming from different spatial locations. In a real application, models using a thin plate spline as spatial random effect produced similar results in estimating a relative index of abundance for a benthic marine species when compared to models incorporating an approximated Gaussian random field. Although they were not the more computational efficient models, their simplicity in parametrization, execution time and predictive performance make them a valid alternative for spatial modelling under Bayesian inference.
- An explicit link between gaussian fields and gaussian markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73(4):423–498, 2011.
- Gaussian Markov random fields: theory and applications. CRC press, 2005.
- Approximate bayesian inference for latent gaussian models by using integrated nested laplace approximations. Journal of the royal statistical society: Series b (statistical methodology), 71(2):319–392, 2009.
- Bayesian spatial modelling with r-inla. Journal of statistical software, 63:1–25, 2015.
- The spde approach for gaussian and non-gaussian fields: 10 years and still running. Spatial Statistics, 50:100599, 2022.
- Accounting for spatial dependence improves relative abundance estimates in a benthic marine species structured as a metapopulation. Fisheries Research, 240:105960, 2021.
- Interpolation of spatial data–a stochastic or a deterministic problem? European Journal of Applied Mathematics, 24(04):601–629, 2013.
- Holger Wendland. Scattered data approximation, volume 17. Cambridge university press, 2004.
- Geostatistics: modeling spatial uncertainty, volume 497. John Wiley & Sons, 2009.
- Basis-function models in spatial statistics. Annual Review of Statistics and Its Application, 9:373–400, 2022.
- Yuedong Wang. Smoothing splines: methods and applications. CRC Press, 2011.
- Grace Wahba. Smoothing noisy data with spline functions. Numerische mathematik, 24(5):383–393, 1975.
- Smoothing noisy data with spline functions. Numerische Mathematik, 47:99–106, 1985.
- Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numerische mathematik, 31(4):377–403, 1978.
- Thin plate spline model under skew-normal random errors: estimation and diagnostic analysis for spatial data. Journal of Statistical Computation and Simulation, 93(1):25–45, 2023.
- Simon N Wood. Thin plate regression splines. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 65(1):95–114, 2003.
- Simon N Wood. Generalized additive models: an introduction with R. CRC press, 2017.
- Grace Wahba. Bayesian “confidence intervals” for the cross-validated smoothing spline. Journal of the Royal Statistical Society: Series B (Methodological), 45(1):133–150, 1983.
- Douglas Nychka. Bayesian confidence intervals for smoothing splines. Journal of the American Statistical Association, 83(404):1134–1143, 1988.
- A correspondence between bayesian estimation on stochastic processes and smoothing by splines. The Annals of Mathematical Statistics, 41(2):495–502, 1970.
- Gentry White. Bayesian semiparametric spatial and joint spatio-temporal modeling. PhD thesis, University of Missouri–Columbia, 2006.
- Direct sampling of bayesian thin-plate splines for spatial smoothing. arXiv preprint arXiv:1906.05575, 2019.
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2021. URL https://www.R-project.org/.
- The no-u-turn sampler: adaptively setting path lengths in hamiltonian monte carlo. J. Mach. Learn. Res., 15(1):1593–1623, 2014.
- Stan: A probabilistic programming language for bayesian inference and optimization. Journal of Educational and Behavioral Statistics, 40(5):530–543, 2015.
- Stan: A probabilistic programming language. Journal of statistical software, 76(1):1–32, 2017.
- Tmb: automatic differentiation and laplace approximation. arXiv preprint arXiv:1509.00660, 2015.
- No-u-turn sampling for fast bayesian inference in admb and tmb: Introducing the adnuts and tmbstan r packages. PloS one, 13(5):e0197954, 2018.
- Michael L Stein. Interpolation of spatial data: some theory for kriging. Springer Science & Business Media, 2012.
- Chengwei Yuan. Models and methods for computationally efficient analysis of large spatial and spatio-temporal data. University of New Hampshire, 2011.
- Spatial and spatio-temporal Bayesian models with R-INLA. John Wiley & Sons, 2015.
- Peter Whittle. On stationary processes in the plane. Biometrika, pages 434–449, 1954.
- Peter Whittle. Stochastic-processes in several dimensions. Bulletin of the International Statistical Institute, 40(2):974–994, 1963.
- Martin D Buhmann. Radial basis functions: theory and implementations, volume 12. Cambridge university press, 2003.
- Grace Wahba. Spline models for observational data, volume 59. Siam, 1990.
- Nonparametric regression and generalized linear models: a roughness penalty approach. Crc Press, 1993.
- Rank-normalization, folding, and localization: An improved rhat for assessing convergence of mcmc. Bayesian Analysis, 2020.
- Bayesian data analysis (vol. 2), 2014.
- Stan Development Team. RStan: the R interface to Stan, 2024. URL https://mc-stan.org/. R package version 2.32.6.
- Quantitative fisheries stock assessment: choice, dynamics and uncertainty. Springer Science & Business Media, 2013.
- John J Ney. Practical use of biological statistics. Inland fisheries management in North America. American Fisheries Society, Bethesda, Maryland, pages 137–158, 1993.
- Standardizing catch and effort data: a review of recent approaches. Fisheries research, 70(2-3):141–159, 2004.
- Stan Development Team et al. Stan modeling language users guide and reference manual, version 2.18. 0, 2018.
- Practical bayesian model evaluation using leave-one-out cross-validation and waic. Statistics and computing, 27:1413–1432, 2017.