Using early rejection Markov chain Monte Carlo and Gaussian processes to accelerate ABC methods (2404.08898v1)
Abstract: Approximate Bayesian computation (ABC) is a class of Bayesian inference algorithms that targets for problems with intractable or {unavailable} likelihood function. It uses synthetic data drawn from the simulation model to approximate the posterior distribution. However, ABC is computationally intensive for complex models in which simulating synthetic data is very expensive. In this article, we propose an early rejection Markov chain Monte Carlo (ejMCMC) sampler based on Gaussian processes to accelerate inference speed. We early reject samples in the first stage of the kernel using a discrepancy model, in which the discrepancy between the simulated and observed data is modeled by Gaussian process (GP). Hence, the synthetic data is generated only if the parameter space is worth exploring. We demonstrate from theory, simulation experiments, and real data analysis that the new algorithm significantly improves inference efficiency compared to existing early-rejection MCMC algorithms. In addition, we employ our proposed method within an ABC sequential Monte Carlo (SMC) sampler. In our numerical experiments, we use examples of ordinary differential equations, stochastic differential equations, and delay differential equations to demonstrate the effectiveness of the proposed algorithm. We develop an R package that is available at https://github.com/caofff/ejMCMC.
- Convolutional neural networks as summary statistics for approximate bayesian computation. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19(6), 3353–3365.
- Altman, N. S. (1992). An introduction to kernel and nearest-neighbor nonparametric regression. The American Statistician 46(3), 175–185.
- Approximate Bayesian computation in population genetics. Genetics 162(4), 2025–2035.
- Nicholson’s blowflies differential equations revisited: main results and open problems. Applied Mathematical Modelling 34(6), 1405–1417.
- A comparative review of dimension reduction methods in approximate bayesian computation. Statistical Science 28(2), 189–208.
- Handbook of markov chain monte carlo, Chapter Likelihood-Free MCMC, pp. 313–333. New York: CRC press.
- Improving approximate Bayesian computation via Quasi-Monte Carlo. Journal of Computational and Graphical Statistics 28(1), 205–219.
- Markov chain monte carlo using an approximation. Journal of Computational and Graphical statistics 14(4), 795–810.
- Componentwise approximate Bayesian computation via Gibbs-like steps. Biometrika 108(3), 591–607.
- Cleveland, W. S. and S. J. Devlin (1988). Locally weighted regression: an approach to regression analysis by local fitting. Journal of the American statistical association 83(403), 596–610.
- Sequential Monte Carlo samplers. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68(3), 411–436.
- An adaptive sequential Monte Carlo method for approximate Bayesian computation. Statistics and Computing 22(5), 1009–1020.
- An introduction to sequential Monte Carlo methods. In Sequential Monte Carlo methods in practice, pp. 3–14. Springer.
- Accelerating pseudo-marginal mcmc using gaussian processes. Computational Statistics & Data Analysis 118, 1–17.
- Epanechnikov, V. A. (1969). Non-parametric estimation of a multivariate probability density. Theory of Probability & Its Applications 14(1), 153–158.
- Everitt, R. G. and P. A. Rowińska (2021). Delayed acceptance ABC-SMC. Journal of Computational and Graphical Statistics 30(1), 55–66.
- Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation [with discussion]. Journal of the Royal Statistical Society. Series B (Statistical Methodology) 74(3), 419–474.
- Asymptotic properties of approximate bayesian computation. Biometrika 105(3), 593–607.
- Bayesian inference using synthetic likelihood: Asymptotics and adjustments. Journal of the American Statistical Association 0(0), 1–12.
- Bayesian data analysis. Chapman and Hall/CRC.
- Learning and Inference for Computational Systems Biology, Chapter Markov chain Monte Carlo algorithms for SDE parameter estimation, pp. 253–276. MIT Press.
- Bayesian optimization for likelihood-free inference of simulator-based statistical models. Journal of Machine Learning Research 17(125), 1–47.
- Learning summary statistic for approximate bayesian computation via deep neural network. Statistica Sinica 27(4), 1595–1618.
- Approximately sufficient statistics and bayesian computation. Statistical applications in genetics and molecular biology 7(1), Article 26.
- Efficient acquisition rules for model-based approximate bayesian computation. Bayesian Analysis 14(2), 595–622.
- Gaussian process modelling in approximate Bayesian computation to estimate horizontal gene transfer in bacteria. The Annals of Applied Statistics 12(4), 2228 – 2251.
- Batch simulations and uncertainty quantification in gaussian process surrogate approximate bayesian computation. In Conference on Uncertainty in Artificial Intelligence, pp. 779–788. PMLR.
- Approximate bayesian computational methods. Statistics and Computing 22(6), 1167–1180.
- Markov chain Monte Carlo without likelihoods. Proceedings of the National Academy of Sciences 100(26), 15324–15328.
- Maruyama, G. (1955). Continuous markov processes and stochastic equations. Rendiconti del Circolo Matematico di Palermo 4, 48–90.
- May, R. M. (1976). Models for single populations. In R. M. May (Ed.), Theoretical ecology, pp. 4–25. Philadelphia, Pennsylvania, USA: W. B. Saunders Company.
- Nicholson, A. J. (1954). An outline of the dynamics of animal populations. Australian Journal of Zoology 2(1), 9–65.
- Nunes, M. A. and D. J. Balding (2010). On optimal selection of summary statistics for approximate bayesian computation. Statistical applications in genetics and molecular biology 9(1), 34.
- Picchini, U. (2014). Inference for SDE models via approximate Bayesian computation. Journal of Computational and Graphical Statistics 23(4), 1080–1100.
- Bayesian synthetic likelihood. Journal of Computational and Graphical Statistics 27(1), 1–11.
- Population growth of human Y chromosomes: a study of Y chromosome microsatellites. Molecular Biology and Evolution 16(12), 1791–1798.
- Reliable abc model choice via random forests. Bioinformatics 32(6), 859–866.
- Likelihood-free markov chain monte carlo. In S. Brooks, G. L. Jones, and X.-L. Meng (Eds.), Handbook of Markov Chain Monte Carlo. Chapman and Hall/CRC Press. In press.
- Handbook of approximate Bayesian computation. CRC Press.
- Sequential Monte Carlo without likelihoods. Proceedings of the National Academy of Sciences 104(6), 1760–1765.
- Solving differential equations in r: package desolve. Journal of statistical software 33, 1–25.
- Kernel interpolation for scalable online gaussian processes. In International Conference on Artificial Intelligence and Statistics, pp. 3133–3141. PMLR.
- Efficient approximate Bayesian computation coupled with Markov chain Monte Carlo without likelihood. Genetics 182(4), 1207–1218.
- Wilkinson, R. (2014). Accelerating abc methods using gaussian processes. In Artificial Intelligence and Statistics, pp. 1015–1023. PMLR.