Repelling-Attracting Hamiltonian Monte Carlo (2403.04607v1)
Abstract: We propose a variant of Hamiltonian Monte Carlo (HMC), called the Repelling-Attracting Hamiltonian Monte Carlo (RAHMC), for sampling from multimodal distributions. The key idea that underpins RAHMC is a departure from the conservative dynamics of Hamiltonian systems, which form the basis of traditional HMC, and turning instead to the dissipative dynamics of conformal Hamiltonian systems. In particular, RAHMC involves two stages: a mode-repelling stage to encourage the sampler to move away from regions of high probability density; and, a mode-attracting stage, which facilitates the sampler to find and settle near alternative modes. We achieve this by introducing just one additional tuning parameter -- the coefficient of friction. The proposed method adapts to the geometry of the target distribution, e.g., modes and density ridges, and can generate proposals that cross low-probability barriers with little to no computational overhead in comparison to traditional HMC. Notably, RAHMC requires no additional information about the target distribution or memory of previously visited modes. We establish the theoretical basis for RAHMC, and we discuss repelling-attracting extensions to several variants of HMC in literature. Finally, we provide a tuning-free implementation via dual-averaging, and we demonstrate its effectiveness in sampling from, both, multimodal and unimodal distributions in high dimensions.
- M. Agarwal and D. Vats. Globally centered autocovariances in MCMC. Journal of Computational and Graphical Statistics, 31(3):629–638, 2022.
- Pseudo-marginal Hamiltonian Monte Carlo. Journal of Machine Learning Research, 22(141):1–45, 2021.
- An introduction to MCMC for machine learning. Machine Learning, 50:5–43, 2003.
- V. I. Arnold. Mathematical Methods of Classical Mechanics, volume 60. Springer Science & Business Media, 2013.
- The heavy ball with friction method, I. the continuous dynamical system: Global exploration of the local minima of a real-valued function by asymptotic analysis of a dissipative dynamical system. Communications in Contemporary Mathematics, 2(01):1–34, 2000.
- Lectures on Morse Homology. Springer, 2004.
- Geometry and dynamics for Markov chain Monte Carlo. Annual Review of Statistics and Its Application, 5:451–471, 2018.
- On damped second-order gradient systems. Journal of Differential Equations, 259(7):3115–3143, 2015.
- G. Benettin and A. Giorgilli. On the Hamiltonian interpolation of near-to-the identity symplectic mappings with application to symplectic integration algorithms. Journal of Statistical Physics, 74:1117–1143, 1994.
- Optimal tuning of the hybrid Monte Carlo algorithm. Bernoulli, 19(5a):1501 – 1534, 2013.
- M. Betancourt. Adiabatic Monte Carlo. arXiv preprint arXiv:1405.3489, 2014.
- M. Betancourt. A Conceptual Introduction to Hamiltonian Monte Carlo. arXiv e-prints, art. arXiv:1701.02434, Jan. 2017.
- The geometric foundations of Hamiltonian Monte Carlo. Bernoulli, 23(4a):2257–2298, 2017.
- Monte Carlo Simulation in Statistical Physics, volume 8. Springer, 1992.
- Non-canonical Hamiltonian Monte Carlo. arXiv preprint arXiv:2008.08191, 2020.
- Handbook of Markov Chain Monte Carlo. CRC press, 2011.
- Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95(451):957–970, 2000. ISSN 01621459.
- Stochastic gradient Hamiltonian Monte Carlo. In International conference on machine learning, pages 1683–1691. Pmlr, 2014.
- M. Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. Advances in Neural Information Processing Systems, 26, 2013.
- Hybrid Monte Carlo. Physics Letters B, 195(2):216–222, Sept. 1987.
- N. Fournier and A. Guillin. On the rate of convergence in Wasserstein distance of the empirical measure. Probability Theory and Related Fields, 162(3-4):707, 2015.
- Conformal symplectic and relativistic optimization. Journal of Statistical Mechanics: Theory and Experiment, 2020(12):124008, 2020.
- Bayesian Data Analysis. Chapman and Hall/CRC, 1995.
- M. Ghosh. Exponential tail bounds for Chi squared random variables. Journal of Statistical Theory and Practice, 15(2):1–6, 2021.
- M. Girolami and B. Calderhead. Riemann manifold Langevin and Hamiltonian Monte Carlo methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73(2):123–214, 2011.
- G. Gobbo and B. J. Leimkuhler. Extended Hamiltonian approach to continuous tempering. Physical Review E, 91(6):061301, 2015.
- M. Graham and A. Storkey. Continuously tempered Hamiltonian Monte Carlo. In Conference on Uncertainty in Artificial Intelligence, 2017.
- Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations, volume 31. Springer Science & Business Media, 2006.
- W. K. Hastings. Monte Carlo sampling methods using Markov chains and heir applications. Biometrika, 57(1):97–109, 1970.
- M. D. Hoffman and A. Gelman. The No-U-Turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15(47):1593–1623, 2014.
- H. Khalil. Nonlinear Systems. Pearson Education. Prentice Hall, 2002. ISBN 9780130673893.
- Equi-energy sampler with applications in statistical inference and statistical mechanics. The Annals of Statistics, 34(4):1581 – 1619, 2006.
- Time-reversal symmetry in dynamical systems: A survey. Physica D: Nonlinear Phenomena, 112(1-2):1–39, 1998.
- Wormhole Hamiltonian Monte Carlo. Proceedings of the AAAI Conference on Artificial Intelligence, pages 1953–1959, 2014.
- B. Leimkuhler and S. Reich. A Metropolis adjusted Nosé-Hoover thermostat. ESAIM: Mathematical Modelling and Numerical Analysis, 43(4):743–755, 2009.
- Generalizing Hamiltonian Monte Carlo with neural networks. In International Conference on Learning Representations, 2018.
- Z. Liu and Z. Zhang. Quantum-inspired Hamiltonian Monte Carlo for Bayesian sampling. ArXiv, abs/1912.01937, 2019.
- On the geometric ergodicity of Hamiltonian Monte Carlo. 2019.
- Relativistic Monte Carlo. In Artificial Intelligence and Statistics, pages 1236–1245. Pmlr, 2017.
- Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning. Advances in Neural Information Processing Systems, 31, 2018.
- Hamiltonian descent methods. arXiv preprint arXiv:1809.05042, 2018.
- Does Hamiltonian Monte Carlo mix faster than a random walk on multimodal densities?, 2018.
- R. McLachlan and M. Perlmutter. Conformal Hamiltonian systems. Journal of Geometry and Physics, 39(4):276–300, 2001.
- Equation of State Calculations by Fast Computing Machines. The Journal of Chemical Physics, 21(6):1087–1092, 1953.
- J. W. Milnor. Morse Theory. Number 51. Princeton University Press, 1963.
- K. Modin and G. Söderlind. Geometric integration of Hamiltonian systems perturbed by Rayleigh damping. BIT Numerical Mathematics, 51:977–1007, 2011.
- An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants. Biometrika, 93(2):451–458, 2006.
- M. Muehlebach and M. I. Jordan. Optimization with momentum: Dynamical, control-theoretic, and symplectic perspectives. The Journal of Machine Learning Research, 22(1):3407–3456, 2021.
- MCMC for doubly-intractable distributions. arXiv preprint arXiv:1206.6848, 2012.
- R. Neal. MCMC Using Hamiltonian Dynamics. In S. Brooks, A. Gelman, G. L. Jones, and X.-L. Meng, editors, Handbook of Markov Chain Monte Carlo, pages 113–162. Chapman & Hall/CRC, 2011.
- R. M. Neal. Bayesian Learning for Neural Networks, volume 118 of Lecture Notes in Statistics. Springer, New York, 1996. ISBN 978-0-387-94724-2.
- R. M. Neal. Slice sampling. Annals of statistics, pages 705–741, 2003.
- Pseudo-extended Markov chain Monte Carlo. Advances in Neural Information Processing Systems, 32, 2019.
- Y. Nesterov. Primal-dual subgradient methods for convex problems. Mathematical Programming, (1):221–259, 2009.
- L. I. Nicolaescu. An Invitation to Morse Theory. Springer, 2011.
- A. Nishimura and D. Dunson. Geometrically tempered Hamiltonian Monte Carlo. arXiv preprint arXiv:1604.00872, 2016.
- Discontinuous Hamiltonian Monte Carlo for discrete parameters and discontinuous likelihoods. Biometrika, 107(2):365–380, 2020.
- V. M. Panaretos and Y. Zemel. Statistical aspects of Wasserstein distances. Annual review of statistics and its application, 6:405–431, 2019.
- J. Park. Sampling from multimodal distributions using tempered Hamiltonian transitions, 2021.
- J. Park and M. Haran. Bayesian inference in the presence of intractable normalizing functions. Journal of the American Statistical Association, 113(523):1372–1390, 2018.
- Optimisation and asymptotic stability. International Journal of Control, 91(11):2404–2410, 2018.
- A framework for adaptive MCMC targeting multimodal distributions. The Annals of Statistics, 48(5):2930 – 2952, 2020.
- S. Reich. Backward error analysis for numerical integrators. SIAM Journal on Numerical Analysis, 36(5):1549–1570, 1999.
- R. Rockafellar. Convex Analysis. Princeton Landmarks in Mathematics and Physics. Princeton University Press, 1997. ISBN 9780691015866.
- S. Sastry. Nonlinear Systems: Analysis, Stability, and Control, volume 10. Springer Science & Business Media, 2013.
- C. Sminchisescu and M. Welling. Generalized darting Monte Carlo. Pattern Recognition, 44(10):2738–2748, 2011.
- Gradient-free Hamiltonian Monte Carlo with efficient kernel exponential families. Advances in Neural Information Processing Systems, 28, 2015.
- A repelling-attracting metropolis algorithm for multimodality. Journal of Computational and Graphical Statistics, 27(3):479–490, 2018.
- The calculation of posterior distributions by data augmentation. Journal of the American Statistical Association, 82(398):528–540, 1987. ISSN 01621459.
- L. Tierney. Markov chains for exploring posterior distributions. the Annals of Statistics, pages 1701–1728, 1994.
- Magnetic Hamiltonian Monte Carlo. In D. Precup and Y. W. teh, editors, Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 3453–3461. Pmlr, 06–11 Aug 2017.
- D. A. van Dyk and X.-L. Meng. The art of data augmentation. Journal of Computational and Graphical Statistics, 10(1):1–50, 2001.
- Asymptotic analysis of a structure-preserving integrator for damped Hamiltonian systems. Discrete and Continuous Dynamical Systems, 41(7):3319–3341, 2020.
- Continuous relaxations for discrete Hamiltonian Monte Carlo. Advances in Neural Information Processing Systems, 25, 2012.