Combining transition path sampling with data-driven collective variables through a reactivity-biased shooting algorithm (2404.02597v1)
Abstract: Rare event sampling is a central problem in modern computational chemistry research. Among the existing methods, transition path sampling (TPS) can generate unbiased representations of reaction processes. However, its efficiency depends on the ability to generate reactive trial paths, which in turn depends on the quality of the shooting algorithm used. We propose a new algorithm based on the shooting success rate, i.e. reactivity, measured as a function of a reduced set of collective variables (CVs). These variables are extracted with a machine learning approach directly from TPS simulations, using a multi-task objective function. Iteratively, this workflow significantly improves shooting efficiency without any prior knowledge of the process. In addition, the optimized CVs can be used with biased enhanced sampling methodologies to accurately reconstruct the free energy profiles. We tested the method on three different systems: a two-dimensional toy model, conformational transitions of alanine dipeptide, and hydrolysis of acetyl chloride in bulk water. In the latter, we integrated our workflow with an active learning scheme to learn a reactive machine learning-based potential, which allowed us to study the mechanism and free energy profile with an ab initio-like accuracy.
- D. Frenkel and B. Smit, Understanding molecular simulation: from algorithms to applications (Elsevier, 2023).
- J. L. Klepeis, K. Lindorff-Larsen, R. O. Dror, and D. E. Shaw, “Long-timescale molecular dynamics simulations of protein structure and function,” Current Opinion in Structural Biology 19, 120–127 (2009), theory and simulation / Macromolecular assemblages.
- S. Piana, J. L. Klepeis, and D. E. Shaw, “Assessing the accuracy of physical models used in protein-folding simulations: quantitative evidence from long molecular dynamics simulations,” Current Opinion in Structural Biology 24, 98–105 (2014), folding and binding / Nucleic acids and their protein complexes.
- R. Car and M. Parrinello, “Unified approach for molecular dynamics and density-functional theory,” Phys. Rev. Lett. 55, 2471–2474 (1985).
- R. C. Bernardi, M. C. Melo, and K. Schulten, “Enhanced sampling techniques in molecular dynamics simulations of biological systems,” Biochimica et Biophysica Acta (BBA) - General Subjects 1850, 872–877 (2015), recent developments of molecular dynamics.
- Y. I. Yang, Q. Shao, J. Zhang, L. Yang, and Y. Q. Gao, “Enhanced sampling in molecular dynamics,” The Journal of Chemical Physics 151, 070902 (2019), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/1.5109531/15562342/070902_1_online.pdf .
- C. Dellago and P. G. Bolhuis, “Transition path sampling and other advanced simulation techniques for rare events,” in Advanced Computer Simulation Approaches for Soft Matter Sciences III, edited by C. Holm and K. Kremer (Springer Berlin Heidelberg, Berlin, Heidelberg, 2009) pp. 167–233.
- P. G. Bolhuis and C. Dellago, “Practical and conceptual path sampling issues,” The European Physical Journal Special Topics 224, 2409–2427 (2015).
- G. Torrie and J. Valleau, “Nonphysical sampling distributions in monte carlo free-energy estimation: Umbrella sampling,” Journal of Computational Physics 23, 187–199 (1977).
- A. Laio and M. Parrinello, “Escaping free-energy minima,” Proceedings of the National Academy of Sciences 99, 12562–12566 (2002), https://www.pnas.org/doi/pdf/10.1073/pnas.202427399 .
- M. Invernizzi and M. Parrinello, “Rethinking metadynamics: From bias potentials to probability distributions,” The Journal of Physical Chemistry Letters 11, 2731–2736 (2020), pMID: 32191470, https://doi.org/10.1021/acs.jpclett.0c00497 .
- Y. Miao, W. Sinko, L. Pierce, D. Bucher, R. C. Walker, and J. A. McCammon, “Improved reweighting of accelerated molecular dynamics simulations for free energy calculation,” Journal of Chemical Theory and Computation 10, 2677–2689 (2014), pMID: 25061441, https://doi.org/10.1021/ct500090q .
- P. Tiwary and M. Parrinello, “A time-independent free energy estimator for metadynamics,” The Journal of Physical Chemistry B 119, 736–742 (2015), pMID: 25046020, https://doi.org/10.1021/jp504920s .
- M. R. Shirts and J. D. Chodera, “Statistically optimal analysis of samples from multiple equilibrium states,” The Journal of Chemical Physics 129, 124105 (2008), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/1.2978177/15418484/124105_1_online.pdf .
- D. Ray and M. Parrinello, “Kinetics from metadynamics: Principles, applications, and outlook,” Journal of Chemical Theory and Computation 19, 5649–5670 (2023), pMID: 37585703, https://doi.org/10.1021/acs.jctc.3c00660 .
- C. Dellago, P. G. Bolhuis, F. S. Csajka, and D. Chandler, “Transition path sampling and the calculation of rate constants,” The Journal of Chemical Physics 108, 1964–1977 (1998), https://pubs.aip.org/aip/jcp/article-pdf/108/5/1964/6682576/1964_1_online.pdf .
- R. J. Allen, P. B. Warren, and P. R. ten Wolde, “Sampling rare switching events in biochemical networks,” Phys. Rev. Lett. 94, 018104 (2005).
- G. A. Huber and S. Kim, “Weighted-ensemble brownian dynamics simulations for protein association reactions,” Biophysical Journal 70, 97–110 (1996).
- A. K. Faradjian and R. Elber, “Computing time scales from reaction coordinates by milestoning,” The Journal of Chemical Physics 120, 10880–10889 (2004), https://pubs.aip.org/aip/jcp/article-pdf/120/23/10880/10857380/10880_1_online.pdf .
- D. Branduardi, F. L. Gervasio, and M. Parrinello, “From A to B in free energy space,” The Journal of Chemical Physics 126, 054103 (2007).
- A. Pérez de Alba Ortíz, A. Tiwari, R. C. Puthenkalathil, and B. Ensing, “Advances in enhanced sampling along adaptive paths of collective variables,” The Journal of Chemical Physics 149, 072320 (2018).
- A. France-Lanord, H. Vroylandt, M. Salanne, B. Rotenberg, A. M. Saitta, and F. Pietrucci, “Data-driven path collective variables,” (2023), arXiv:2312.13868 [physics.chem-ph] .
- T. Fröhlking, L. Bonati, V. Rizzi, and F. L. Gervasio, “Deep learning path-like collective variable for enhanced sampling molecular dynamics,” arXiv preprint arXiv:2402.01508 (2024).
- P. G. Bolhuis and D. W. H. Swenson, “Transition path sampling as markov chain monte carlo of trajectories: Recent algorithms, software, applications, and future outlook,” Advanced Theory and Simulations 4, 2000237 (2021), https://onlinelibrary.wiley.com/doi/pdf/10.1002/adts.202000237 .
- C. Dellago, P. G. Bolhuis, and D. Chandler, “Efficient transition path sampling: Application to Lennard-Jones cluster rearrangements,” The Journal of Chemical Physics 108, 9236–9245 (1998), https://pubs.aip.org/aip/jcp/article-pdf/108/22/9236/10791789/9236_1_online.pdf .
- H. Jung, K.-i. Okazaki, and G. Hummer, “Transition path sampling of rare events by shooting from the top,” The Journal of Chemical Physics 147, 152716 (2017), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/1.4997378/15535688/152716_1_online.pdf .
- H. Jung, R. Covino, A. Arjun, C. Leitold, C. Dellago, P. G. Bolhuis, and G. Hummer, “Machine-guided path sampling to discover mechanisms of molecular self-organization,” Nature Computational Science 3, 334–345 (2023).
- G. Lazzeri, H. Jung, P. G. Bolhuis, and R. Covino, “Molecular free energies, rates, and mechanisms from data-efficient path sampling simulations,” Journal of Chemical Theory and Computation 19, 9060–9076 (2023), pMID: 37988412, https://doi.org/10.1021/acs.jctc.3c00821 .
- P. G. Bolhuis, C. Dellago, and D. Chandler, “Reaction coordinates of biomolecular isomerization,” Proceedings of the National Academy of Sciences 97, 5877–5882 (2000), https://www.pnas.org/doi/pdf/10.1073/pnas.100127697 .
- R. B. Best and G. Hummer, “Reaction coordinates and rates from transition paths,” Proceedings of the National Academy of Sciences 102, 6732–6737 (2005), https://www.pnas.org/doi/pdf/10.1073/pnas.0408098102 .
- M. Frassek, A. Arjun, and P. G. Bolhuis, “An extended autoencoder model for reaction coordinate discovery in rare event molecular dynamics datasets,” The Journal of Chemical Physics 155, 064103 (2021), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/5.0058639/15965191/064103_1_online.pdf .
- L. Bonati, E. Trizio, A. Rizzi, and M. Parrinello, “A unified framework for machine learning collective variables for enhanced sampling simulations: mlcolvar,” The Journal of Chemical Physics 159, 014801 (2023), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/5.0156343/18281524/014801_1_5.0156343.pdf .
- G. Hummer, “From transition paths to transition states and rate coefficients,” The Journal of Chemical Physics 120, 516–523 (2004), https://pubs.aip.org/aip/jcp/article-pdf/120/2/516/19289109/516_1_online.pdf .
- B. W. Silverman, Density estimation for statistics and data analysis (Routledge, 2018).
- M. M. Sultan and V. S. Pande, “Automated design of collective variables using supervised machine learning,” The Journal of Chemical Physics 149, 094106 (2018).
- D. Mendels, G. Piccini, and M. Parrinello, “Collective variables from local fluctuations,” The Journal of Physical Chemistry Letters 9, 2776–2781 (2018).
- L. Bonati, V. Rizzi, and M. Parrinello, “Data-driven collective variables for enhanced sampling,” Journal of Physical Chemistry Letters 11, 2998–3004 (2020).
- E. Trizio and M. Parrinello, “From enhanced sampling to reaction profiles,” The Journal of Physical Chemistry Letters 12, 8621–8626 (2021).
- D. Ray, E. Trizio, and M. Parrinello, “Deep learning collective variables from transition path ensemble,” The Journal of Chemical Physics 158, 204102 (2023), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/5.0148872/17697302/204102_1_5.0148872.pdf .
- D. W. H. Swenson, J.-H. Prinz, F. Noe, J. D. Chodera, and P. G. Bolhuis, “Openpathsampling: A python framework for path sampling simulations. 1. basics,” Journal of Chemical Theory and Computation 15, 813–836 (2019a), pMID: 30336030, https://doi.org/10.1021/acs.jctc.8b00626 .
- D. W. H. Swenson, J.-H. Prinz, F. Noe, J. D. Chodera, and P. G. Bolhuis, “Openpathsampling: A python framework for path sampling simulations. 2. building and customizing path ensembles and sample schemes,” Journal of Chemical Theory and Computation 15, 837–856 (2019b), pMID: 30359525, https://doi.org/10.1021/acs.jctc.8b00627 .
- Z. Fan, Z. Zeng, C. Zhang, Y. Wang, K. Song, H. Dong, Y. Chen, and T. Ala-Nissila, “Neuroevolution machine learning potentials: Combining high accuracy and low cost in atomistic simulations and application to heat transport,” Phys. Rev. B 104, 104309 (2021).
- Z. Fan, “Improving the accuracy of the neuroevolution machine learning potential for multi-component systems,” Journal of Physics: Condensed Matter 34, 125902 (2022).
- M. Yang, L. Bonati, D. Polino, and M. Parrinello, “Using metadynamics to build neural network potentials for reactive events: the case of urea decomposition in water,” Catalysis Today 387, 143–149 (2022), 100 years of CASALE SA: a scientific perspective on catalytic processes.
- F. Ruff and Ö. Farkas, “Concerted sn2 mechanism for the hydrolysis of acid chlorides: comparisons of reactivities calculated by the density functional theory with experimental data,” Journal of Physical Organic Chemistry 24, 480–491 (2011), https://onlinelibrary.wiley.com/doi/pdf/10.1002/poc.1790 .
- S. Falkner, A. Coretti, and C. Dellago, “Enhanced sampling of configuration and path space in a generalized ensemble by shooting point exchange,” (2023), arXiv:2302.08757 [physics.comp-ph] .
- P. Kang, E. Trizio, and M. Parrinello, “Computing the committor with the committor: an anatomy of the transition state ensemble,” (2024), arXiv:2401.05279 [physics.comp-ph] .