Accelerating Multilevel Markov Chain Monte Carlo Using Machine Learning Models (2405.11179v1)
Abstract: This work presents an efficient approach for accelerating multilevel Markov Chain Monte Carlo (MCMC) sampling for large-scale problems using low-fidelity machine learning models. While conventional techniques for large-scale Bayesian inference often substitute computationally expensive high-fidelity models with machine learning models, thereby introducing approximation errors, our approach offers a computationally efficient alternative by augmenting high-fidelity models with low-fidelity ones within a hierarchical framework. The multilevel approach utilizes the low-fidelity machine learning model (MLM) for inexpensive evaluation of proposed samples thereby improving the acceptance of samples by the high-fidelity model. The hierarchy in our multilevel algorithm is derived from geometric multigrid hierarchy. We utilize an MLM to acclerate the coarse level sampling. Training machine learning model for the coarsest level significantly reduces the computational cost associated with generating training data and training the model. We present an MCMC algorithm to accelerate the coarsest level sampling using MLM and account for the approximation error introduced. We provide theoretical proofs of detailed balance and demonstrate that our multilevel approach constitutes a consistent MCMC algorithm. Additionally, we derive conditions on the accuracy of the machine learning model to facilitate more efficient hierarchical sampling. Our technique is demonstrated on a standard benchmark inference problem in groundwater flow, where we estimate the probability density of a quantity of interest using a four-level MCMC algorithm. Our proposed algorithm accelerates multilevel sampling by a factor of two while achieving similar accuracy compared to sampling using the standard multilevel algorithm.
- Christian P. Robert and Wu Changye. Markov Chain Monte Carlo Methods, Survey with Some Frequent Misunderstandings. Wiley, New York, 2021.
- Accelerating MCMC algorithms. WIREs Computational Statistics, 10, 2017.
- Mcmc with delayed acceptance using a surrogate model with an application to cardiovascular fluid dynamics. In Proceedings of the International Conference on Statistics: Theory and Applications, Lisbon, Portugal, August 2019.
- Accelerating markov chain monte carlo sampling with diffusion models. Computer Physics Communications, 296, 2024.
- An adaptive surrogate modeling based on deep neural networks for large-scale bayesian inverse problems. Communications on Computational Physics, 28(5), 2020.
- Accelerating MCMC algorithms through bayesian deep networks. In Proceedings of the 34th Conference on Neural Information Processing Systems, Canada, December 2020.
- Approximation of the likelihood function in bayesian technique for the solution of inverse problems. Inverse Problems in Science and Engineering, 16(6), 2008.
- Accelerating the bayesian inference of inverse problems by using data-driven compressive sensing method based on proper orthogonal decomposition. Electronic Research Archive, 29(5), 2021.
- Fast and accurate proper orthogonal decomposition using efficient sampling and iterative techniques for singular value decomposition. ACM Tranactions on Mathematical Software, 48(2), 2022.
- Markov chain monte carlo using an approximation. Journal of Computational and Graphical statistics, 14(4):795–810, 2005.
- Preconditioning Markov chain Monte Carlo simulations using coarse-scale models. SIAM Journal on Scientific Computing, 28(2):776–803, 2006.
- Delayed acceptance particle MCMC for exact inference in stochastic kinetic models. Statistics and Computing, 25:1039–2055, 2015.
- Speeding up MCMC by delayed acceptance and data subsampling. Journal of Computational and Graphical Statistics, 27(1):12–22, 2018.
- A hierarchical multilevel markov chain monte carlo algorithm with applications to uncertainty quantification in subsurface flows. SIAM Journal of Uncertainty Quantification, 3:1075–1108, 2015.
- Multilevel delayed acceptance mcmc. SIAM Journal of Uncertainty Quantification, 11(1):1–30, 2023.
- A multilevel, hierarchical decomposition of finite element white noise with application to multilevel markov chain monte carlo. SIAM Journal of Scientific Computing, 43(5):S293–S316, 2021.
- Accelerating uncertainty quantification of groundwater flow modelling using a deep neural network proxy. Computer Methods in Applied Mechanics and Engineering, 383:113895, 2021.
- Stochastic finite elements: a spectral approach. Courier Corporation, 2003.
- R. Bassett and J. Derida. Maximum a posteriori estimators as a limit of bayes estimators. arXiv:1611.05917v2, 2018.
- Variational inference: A review for statisticians. arXiv:1601.00670v9, 2018.
- E. Hernandez-Lemus. Random fields in physics, biology and data science. Frontiers in Physics, 9:641859, 2021.
- Further analysis of multilevel monte carlo methods for elliptic pdes with random coefficients. Numerische Mathematik, 125(3):569–600, 2013.
- An explicit link between gaussian fields and gaussian markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73(4):423–498, 2011.
- A multilevel, hierarchical sampling technique for spatially correlated random fields. SIAM Journal of Scientific Computing, 39(5):S5431–S562, 2017.
- Scalable hierarchical PDE sampler for generating spatially correlated random fields using nonmatching meshes. Numerical Linear Algebra with Applications, 25(3):e2146, 2018.
- Parallel solver for H(div) problems using hybridization and AMG. In Domain Decomposition Methods in Science and Engineering XXIII, pages 69–80. Springer, 2017.
- Jun S. Liu. Monte Carlo Strategies in Scientific Computing. Springer, New York, 2008.
- MFEM: Modular finite element methods library. mfem.org.
- HYPRE: High performance preconditioners. http://www.llnl.gov/CASC/hypre/.
- Algebraic hybridization and static condensation with application to scalable H(div) preconditioning. SIAM Journal on Scientific Computing, 41(3):B425–B447, 2019.
- TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.
- A. Marrel and B. Iooss. Probabilistic surrogate modeling by gaussian process: A review on recent insights in estimation and validation. Reliability Engineering & System Safety, 247:110094, 2024.
- Evaluation of POD based surrogate models of fields resulting from nonlinear fem simulations. Advanced Modeling and Simulation in Engineering Sciences, 8:25, 2021.
- K. Lee and K.T. Carlberg. Model reduction of dynamical systems on nonlinear manifolds using deep convolutional autoencoders. Journal of Computational Physics, 404:108973, 2020.