Fast Adaptive Fourier Integration for Spectral Densities of Gaussian Processes (2404.19053v1)
Abstract: The specification of a covariance function is of paramount importance when employing Gaussian process models, but the requirement of positive definiteness severely limits those used in practice. Designing flexible stationary covariance functions is, however, straightforward in the spectral domain, where one needs only to supply a positive and symmetric spectral density. In this work, we introduce an adaptive integration framework for efficiently and accurately evaluating covariance functions and their derivatives at irregular locations directly from \textit{any} continuous, integrable spectral density. In order to make this approach computationally tractable, we employ high-order panel quadrature, the nonuniform fast Fourier transform, and a Nyquist-informed panel selection heuristic, and derive novel algebraic truncation error bounds which are used to monitor convergence. As a result, we demonstrate several orders of magnitude speedup compared to naive uniform quadrature approaches, allowing us to evaluate covariance functions from slowly decaying, singular spectral densities at millions of locations to a user-specified tolerance in seconds on a laptop. We then apply our methodology to perform gradient-based maximum likelihood estimation using a previously numerically infeasible long-memory spectral model for wind velocities below the atmospheric boundary layer.
- Uniform approximation of common Gaussian process kernels using equispaced Fourier grids. arXiv preprint arXiv:2305.11065.
- A parallel nonuniform fast Fourier transform library based on an “exponential of semicircle” kernel. SIAM Journal on Scientific Computing, 41(5):C479–C504.
- Scalable computations for nonstationary Gaussian processes. Statistics and Computing, 33(4):84.
- Regular variation. Number 27. Cambridge university press.
- Knitro: an integrated package for nonlinear optimization. In Di Pillo, G. and Roma, M., editors, Large-Scale Nonlinear Optimization, Nonconvex Optimization and Its Applications, pages 35–59. Springer US, Boston, MA.
- Linear-cost covariance functions for Gaussian random fields. Journal of the American Statistical Association, 118(541):147–164.
- Algorithms to numerically evaluate the Hankel transform. Computers & Mathematics with Applications, 26(1):1–12.
- Classes of nonseparable, spatio-temporal stationary covariance functions. Journal of the American Statistical association, 94(448):1330–1339.
- Fast Fourier transforms for nonequispaced data. SIAM Journal on Scientific computing, 14(6):1368–1393.
- Scalable Gaussian process computations using hierarchical matrices. Journal of Computational and Graphical Statistics, 29(2):227–237.
- Flexible nonstationary spatiotemporal modeling of high-frequency monitoring data. Environmetrics, 32(5):e2670.
- Fitting Matérn smoothness parameters using automatic differentiation. Statistics and Computing, 33(2):48.
- A scalable method to exploit screening in Gaussian process models with noise. Journal of Computational and Graphical Statistics, (just-accepted):1–19.
- A fast algorithm for the calculation of the roots of special functions. SIAM Journal on Scientific Computing, 29(4):1420–1438.
- Gonnet, P. (2012). A review of error estimation in adaptive quadrature. ACM Computing Surveys (CSUR), 44(4):1–36.
- The fast sinc transform and image reconstruction from nonuniform samples in k-space. Communications in Applied Mathematics and Computational Science, 1(1):121–131.
- Equispaced Fourier representations for efficient Gaussian process regression from a billion data points. arXiv preprint arXiv:2210.10210.
- Evaluating derivatives: principles and techniques of algorithmic differentiation. SIAM.
- Guinness, J. (2019). Spectral density estimation for random fields via periodic embeddings. Biometrika, 106(2):267–286.
- Guinness, J. (2021). Gaussian process learning via Fisher scoring of Vecchia’s approximation. Statistics and Computing, 31(3):25.
- Fast and accurate computation of Gauss–Legendre and Gauss–Jacobi quadrature nodes and weights. SIAM Journal on Scientific Computing, 35(2):A652–A674.
- Variational Fourier features for Gaussian processes. Journal of Machine Learning Research, 18(151):1–52.
- Half-spectral space–time covariance models. Spatial Statistics, 19:90–100.
- Semiparametric estimation of spectral density with irregular observations. Journal of the American Statistical Association, 102(478):726–735.
- Johansson, F. (2017). Arb: efficient arbitrary-precision midpoint-radius interval arithmetic. IEEE T Comput, 66:1281–1292.
- A general framework for Vecchia approximations of Gaussian processes.
- Interpolative butterfly factorization. SIAM Journal on Scientific Computing, 39(2):A503–A531.
- Radar Wind Profiler (RWP) and Radio Acoustic Sounding System (RASS) Instrument Handbook (DOE/SC-ARM-TR-044). DOE Office of Science Atmospheric Radiation Measurement (ARM) Program, United States.
- Newsom, R. (2012). Doppler lidar (Dl) handbook. Technical Report DOE/SC-ARM/TR-101, DOE Office of Science Atmospheric Radiation Measurement (ARM) Program (United States).
- Olver, F. (2010). NIST handbook of mathematical functions hardback and CD-ROM. Cambridge university press.
- An algorithm for the rapid evaluation of special function transforms. Applied and Computational Harmonic Analysis, 28(2):203–226.
- Spatial modelling using a new class of nonstationary covariance functions. Environmetrics: The official journal of the International Environmetrics Society, 17(5):483–506.
- On some local, global and regularity behaviour of some classes of covariance functions. In Advances and challenges in space-time modelling of natural events, pages 221–238. Springer.
- Random features for large-scale kernel machines. Advances in neural information processing systems, 20.
- Nonparametric estimation of nonstationary spatial covariance structure. Journal of the American Statistical Association, 87(417):108–119.
- Stein, M. (1999). Interpolation of spatial data: some theory for kriging. Springer Science & Business Media.
- Stein, M. (2005). Statistical methods for regular monitoring data. Journal of the Royal Statistical Society Series B: Statistical Methodology, 67(5):667–687.
- Approximating likelihoods for large spatial data sets. Journal of the Royal Statistical Society Series B: Statistical Methodology, 66(2):275–296.
- The debiased whittle likelihood. Biometrika, 106(2):251–266.
- Townsend, A. (2015). A fast analysis-based discrete Hankel transform using asymptotic expansions. SIAM Journal on Numerical Analysis, 53(4):1897–1917.
- Vecchia, A. (1988). Estimation and model identification for continuous spatial processes. Journal of the Royal Statistical Society Series B: Statistical Methodology, 50(2):297–312.
- Whittle, P. (1951). Hypothesis testing in time series analysis.
- Whittle, P. (1963). Stochastic-processes in several dimensions. Bulletin of the International Statistical Institute, 40(2):974–994.
- Gaussian process kernels for pattern discovery and extrapolation. In International conference on machine learning, pages 1067–1075. PMLR.
- Zhang, H. (2004). Inconsistent estimation and asymptotically equal interpolations in model-based geostatistics. Journal of the American Statistical Association, 99(465):250–261.