A fully Bayesian sparse polynomial chaos expansion approach with joint priors on the coefficients and global selection of terms (2204.06043v2)
Abstract: Polynomial chaos expansion (PCE) is a versatile tool widely used in uncertainty quantification and machine learning, but its successful application depends strongly on the accuracy and reliability of the resulting PCE-based response surface. High accuracy typically requires high polynomial degrees, demanding many training points especially in high-dimensional problems through the curse of dimensionality. So-called sparse PCE concepts work with a much smaller selection of basis polynomials compared to conventional PCE approaches and can overcome the curse of dimensionality very efficiently, but have to pay specific attention to their strategies of choosing training points. Furthermore, the approximation error resembles an uncertainty that most existing PCE-based methods do not estimate. In this study, we develop and evaluate a fully Bayesian approach to establish the PCE representation via joint shrinkage priors and Markov chain Monte Carlo. The suggested Bayesian PCE model directly aims to solve the two challenges named above: achieving a sparse PCE representation and estimating uncertainty of the PCE itself. The embedded Bayesian regularizing via the joint shrinkage prior allows using higher polynomial degrees for given training points due to its ability to handle underdetermined situations, where the number of considered PCE coefficients could be much larger than the number of available training points. We also explore multiple variable selection methods to construct sparse PCE expansions based on the established Bayesian representations, while globally selecting the most meaningful orthonormal polynomials given the available training data. We demonstrate the advantages of our Bayesian PCE and the corresponding sparsity-inducing methods on several benchmarks.
- Intuitive Joint Priors for Bayesian Linear Multilevel Models: The R2D2M2 prior. arXiv preprint (2022).
- SAMBA: sparse approximation of moment-based arbitrary polynomial chaos. Journal of Computational Physics 320 (2016), 1–16.
- Akhiezer, N. The classical moment problem. Hafner Publication Company, New York (1965).
- Some basic hypergeometric polynomials that generalize Jacobi polynomials. Memoirs of the American Mathematical Society, AMS, Providence, 1985.
- Bayesian calibration and validation of a large-scale and time-demanding sediment transport model. Water Resources Research 56, 7 (2020).
- Betancourt, M. A conceptual introduction to Hamiltonian Monte Carlo. arXiv preprint (2017).
- Default Bayesian analysis with global-local shrinkage priors. Biometrika 103, 4 (2016), 955–969.
- Sparse polynomial chaos expansions and adaptive stochastic finite elements using a regression approach. C. R. Mécanique 336, 6 (2008), 518–523.
- Bürkner, P.-C. brms: An R package for Bayesian multilevel models using Stan. Journal of Statistical Software 80, 1 (2017), 1–28.
- Bürkner, P.-C. Advanced Bayesian multilevel modeling with the R package brms. The R Journal 10, 1 (2018), 395–411.
- The orthogonal development of non-linear functionals in series of Fourier-Hermite functionals. Annals of Mathematics 48 (1947), 385–392.
- Stan: A probabilistic programming language. Journal of Statistical Software 76, 1 (2017), 1–32.
- Latent space projection predictive inference. arXiv preprint (2021).
- Projection Predictive Inference for Generalized Linear and Additive Multilevel Models. Artificial Intelligence and Statistics (AISTATS) Conference Proceedings (2022).
- Sparse gaussian process model with mixed covariance function for uncertainty quantification. International Journal for Uncertainty Quantification (2021).
- Adaptive smolyak pseudospectral approximations. SIAM Journal on Scientific Computing 35, 6 (2013), A2643–A2670.
- Near-optimal data-independent point locations for radial basis function interpolation. Advances in Computational Mathematics 23, 3 (2005), 317–330.
- On the convergence of generalized polynomial chaos expansions. ESAIM: Mathematical Modelling and Numerical Analysis 46, 2 (2012), 317–339.
- Nonlinear stochastic model predictive control via regularized polynomial chaos expansions. In IEEE Conference on Decision and Control (CDC) (2012), vol. 51, pp. 142–147.
- Favard, J. Sur les polynomes de Tchebicheff. CR Academic Science, Paris 200 (1935).
- Visualization in Bayesian workflow. Journal of the Royal Statistical Society: Series A (Statistics in Society) 182, 2 (2019), 389–402.
- Gautschi, W. Orthogonal polynomials: computation and approximation. Numerical Mathematics and Scientific Computation. Oxford University Press, New York, 2004.
- Bayesian Data Analysis (3rd Edition). Chapman and Hall/CRC, London, 2013.
- Stochastic finite elements: A spectral approach. Springer, New York, 1991.
- Least squares polynomial chaos expansion: A review of sampling strategies. Computer Methods in Applied Mechanics and Engineering 332 (2018), 382–407.
- The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research 15, 1 (2014), 1593–1623.
- An importance quantification technique in uncertainty analysis for computer models. In Proceedings. First International Symposium on Uncertainty Modeling and Analysis (1990), pp. 398–403.
- An introduction to statistical learning, vol. 112. Springer, 2013.
- Karlin, S. Total Positivity, vol. I. Stanford University Press, 1968.
- Sparse quadrature as an alternative to Monte Carlo for stochastic finite element techniques. Proceedings of Applied Mathematical Mechanics 3 (2003), 493–494.
- Comparison of data-driven uncertainty quantification methods for a carbon dioxide storage benchmark scenario. Computational Geosciences (2019).
- Datasets and executables of data-driven uncertainty quantification benchmark in carbon dioxide storage. Zenodo (2017).
- Intrusive uncertainty quantification for hyperbolic-elliptic systems governing two-phase flow in heterogeneous porous media. Computational Geosciences 21, 4 (2017), 807–832.
- Central-upwind schemes on triangular grids for hyperbolic systems of conservation laws. Numerical Methods for Partial Differential Equations 21, 3 (2005), 536–552.
- Probabilistic collocation method for flow in porous media: Comparisons with other stochastic methods. Water Resources Research 43, 9 (2007), 1–13.
- Galerkin methods for linear and nonlinear elliptic stochastic partial differential equations. Computer Methods in Applied Mechanics and Engineering 194, 12-16 (2005), 1295–1331.
- Uncertainty-aware validation benchmarks for coupling free flow and porous-medium flow. arXiv preprint (2022).
- Moore, E. H. On the reciprocal of the general algebraic matrix. Bulletin of the American Mathematical Society 26 (1920), 394–395.
- Least squares approximation-based polynomial chaos expansion for uncertainty quantification and robust optimization in aeronautics. In AIAA AVIATION FORUM (2020).
- Oladyshkin, S. aPC Matlab toolbox: Data-driven arbitrary polynomial chaos. (https://www.mathworks.com/matlabcentral/fileexchange/72014-apc-matlab-toolbox-data-driven-arbitrary-polynomial-chaos, 2022.
- An integrative approach to robust design and probabilistic risk assessment for CO22{}_{2}start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT storage in geological formations. Computational Geosciences 15, 3 (2011), 565–577.
- Bayesian updating via Bootstrap filtering combined with data-driven polynomial chaos expansions: methodology and application to history matching for carbon dioxide storage in geological formations. Computational Geosciences 17, 4 (2013), 671–687.
- Global sensitivity analysis: a flexible and efficient framework with an example from stochastic hydrogeology. Advances in Water Resources 37 (2012), 10–22.
- Data-driven uncertainty quantification using the arbitrary polynomial chaos expansion. Reliability Engineering & System Safety 106 (2012), 179–190.
- Incomplete statistical information limits the utility of high-order polynomial chaos expansions. Reliability Engineering & System Safety 169 (2018), 137–148.
- Chaos expansion based Bootstrap filter to calibrate CO2 injection models. Energy Procedia 40 (2013), 398–407.
- A sequential sparse polynomial chaos expansion using bayesian regression for geotechnical reliability estimations. International Journal for Numerical and Analytical Methods in Geomechanics 44, 6 (2020), 874–889.
- Uncertainty quantification on the effects of rain-induced erosion on annual energy production and performance of a multi-mw wind turbine. Renewable Energy 165 (2021), 701–715.
- Using reference models in variable selection. arXiv preprint (2020).
- Penrose, R. On best approximate solutions of linear matrix equations. In Mathematical Proceedings of the Cambridge Philosophical Society (1956), vol. 52, Cambridge University Press, pp. 17–19.
- Projective inference in high-dimensional problems: Prediction and feature selection. Electronic Journal of Statistics 14, 1 (2020), 2155–2197. Publisher: Institute of Mathematical Statistics and Bernoulli Society.
- Sparsity information and regularization in the horseshoe and other shrinkage priors. Electronic Journal of Statistics 11, 2 (2017), 5018–5051.
- Uncertainty quantification for systems of conservation laws. Journal of Computational Physics 228, 7 (2009), 2443–2467.
- Shrink Globally, Act Locally: Sparse Bayesian Regularization and Prediction. In Bayesian Statistics 9, J. M. Bernardo, M. J. Bayarri, J. O. Berger, A. P. Dawid, D. Heckerman, A. F. M. Smith, and M. West, Eds. Oxford University Press, 2011, pp. 501–538.
- A probabilistic approach to uncertainty quantification with limited information. Reliability Engineering & System Safety 85, 1 (2004), 183–190.
- Practical Hilbert space approximate Bayesian Gaussian processes for probabilistic programming. Statistics and Computing 33, 1 (Dec. 2022), 17.
- Runge, C. Über empirische funktionen und die interpolation zwischen äquidistanten ordinaten. Zeitschrift für Mathematik und Physik 46, 224-243 (1901), 20.
- Surrogate-based bayesian comparison of computationally expensive models: application to microbially induced calcite precipitation. Computational Geosciences 25, 6 (2021), 1899–1917.
- Bayesian sparse polynomial chaos expansion for global sensitivity analysis. Computer Methods in Applied Mechanics and Engineering 318 (2017), 474–496.
- The problem of moments, mathematical surveys No. 1. American Mathematical Society, New York 1950 (1943).
- Siebert, W. M. On the determinants of moment matrices. Annals of Statistics 17, 2 (1989), 711–721.
- Sobol’, I. M. On the distribution of points in a cube and the approximate evaluation of integrals. Zhurnal Vychislitel’noi Matematiki i Matematicheskoi Fiziki 7, 4 (1967), 784–802.
- Sobol’, I. M. On sensitivity estimation for nonlinear mathematical models. Matematicheskoe modelirovanie 2, 1 (1990), 112–118.
- Sobol’, I. M. Theorems and examples on high dimensional model representation. Reliability Engineering and System Safety 79, 2 (2003), 187–193. Publisher: Elsevier.
- Stan Development Team. Stan Modeling Language: User’s Guide and Reference Manual, 2022.
- Stieltjes, T. J. Quelques recherches sur la théorie des quadratures dites mécaniques. In Annales scientifiques de l’École Normale Supérieure (1884), vol. 1, pp. 409–426.
- Sudret, B. Global sensitivity analysis using polynomial chaos expansions. Reliability engineering & system safety 93, 7 (2008), 964–979.
- Sullivan, T. J. Introduction to uncertainty quantification, vol. 63 of Texts in Applied Mathematics. Springer, Cham, 2015.
- Regression-based sparse polynomial chaos for uncertainty quantification of subsurface flow models. Journal of Computational Physics 399 (2019).
- Solutions of ill-posed problems. Scripta Series in Mathematics. V. H. Winston & Sons, Washington, D.C.: John Wiley & Sons, New York-Toronto, Ont.-London, 1977. Translated from the Russian, Preface by translation editor Fritz John.
- Tipping, M. E. Sparse bayesian learning and the relevance vector machine. Journal of machine learning research 1, Jun (2001), 211–244.
- The predictive Lasso. Statistics and Computing 22, 5 (2012), 1069–1084.
- Shrinkage priors for Bayesian penalized regression. Journal of Mathematical Psychology 89 (2019), 31–50.
- Rank-normalization, folding, and localization: An improved R^^𝑅\widehat{R}over^ start_ARG italic_R end_ARG for assessing convergence of MCMC (with discussion). Bayesian Analysis 16, 2 (2021), 667––718.
- Solution of differential equation models by polynomial approximation. Prentice-Hall, 1978.
- Wendland, H. Scattered data approximation, vol. 17. Cambridge University Press, 2004.
- Wiener, N. The homogeneous chaos. American Journal of Mathematics 60, 4 (1938), 897–936.
- A vectorial kernel orthogonal greedy algorithm. Dolomites Research Notes on Approximation 6 (2013).
- Modeling arbitrary uncertainties using Gram-Schmidt polynomial chaos. In 44th AIAA aerospace sciences meeting and exhibit (2006).
- Wood, S. N. Generalized additive models: an introduction with R. London: Chapman and Hall/CRC, 2017.
- The Wiener–Askey polynomial chaos for stochastic differential equations. SIAM Journal on Scientific Computing 24, 2 (2002), 619–644.
- Modeling uncertainty in flow simulations via generalized polynomial chaos. Journal of Computational Physics 187 (2003), 137–167.
- Yes, but did it work?: Evaluating variational inference. In International Conference on Machine Learning (2018), pp. 5581–5590.
- A sparse grid based bayesian method for contaminant source identification. Advances in Water Resources 37 (2012), 1–9.
- Optimized sparse polynomial chaos expansion with entropy regularization. Advances in Aerodynamics 4, 1 (2022).
- Evaluation of multiple reduced-order models to enhance confidence in global sensitivity analyses. International Journal of Greenhouse Gas Control 49 (2016), 217–226.
- Bayesian regression using a prior on the model fit: The R2-D2 shrinkage prior. Journal of the American Statistical Association (2020).