Automated optimization of force field parameters against ensemble-averaged measurements with Bayesian Inference of Conformational Populations
Abstract: Accurate force fields are essential for reliable molecular simulations. These models are refined against quantum mechanical calculations and experimental measurements, which are subject to random and systematic errors. Bayesian Inference of Conformational Populations (BICePs) is a reweighting algorithm that reconciles simulated ensembles with sparse or noisy observables by sampling the full posterior distribution of conformational populations and experimental uncertainty. In this method, a metric called the BICePs score is used to perform model selection, by calculating the free energy of "turning on" the conformational populations under experimental restraints. This approach, when used with improved likelihood functions to deal with experimental outliers, can be used for force field validation (Raddi et al. 2023). Here, we extend the BICePs approach to perform automated force field refinement while simultaneously sampling the full distribution of uncertainties, using a variational method to minimize the BICePs score. To demonstrate the utility of this method, we refine multiple interaction parameters for a 12-mer HP lattice model using ensemble-averaged distance measurements as restraints. To illustrate the resilience of BICePs in the presence of unknown random and systematic errors, we assess the performance of our algorithm through repeated optimizations and under various extents of experimental error. Our results suggest that variational optimization of the BICePs score is a promising direction for robust and automatic parameterization of molecular potentials.
- S. Boothroyd, P. K. Behara, O. C. Madin, D. F. Hahn, H. Jang, V. Gapsys, J. R. Wagner, J. T. Horton, D. L. Dotson, M. W. Thompson, et al., “Development and benchmarking of open force field 2.0. 0: The sage small molecule force field,” Journal of Chemical Theory and Computation (2023).
- J. S. Smith, B. T. Nebgen, R. Zubatyuk, N. Lubbers, C. Devereux, K. Barros, S. Tretiak, O. Isayev, and A. E. Roitberg, “Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning,” Nature communications 10, 2903 (2019).
- J. Köfinger and G. Hummer, ‘‘Empirical Optimization of Molecular Simulation Force Fields by Bayesian Inference,” The European Physical Journal B 94, 245 (2021).
- L.-P. Wang, T. J. Martinez, and V. S. Pande, “Building Force Fields: An Automatic, Systematic, and Reproducible Approach,” The journal of physical chemistry letters 5, 1885–1891 (2014).
- Y. Ge and V. A. Voelz, “Model Selection Using BICePs: a Bayesian Approach for Force Field Validation and Parameterization,” J. Phys. Chem. B 122, 5610–5622 (2018).
- O. C. Madin, S. Boothroyd, R. A. Messerly, J. Fass, J. D. Chodera, and M. R. Shirts, “Bayesian-Inference-Driven Model Parametrization and Model Selection for 2CLJQ Fluid Models,” J. Chem. Inf. Model. 62, 874–889 (2022).
- V. A. Voelz, Y. Ge, and R. M. Raddi, “Reconciling Simulations and Experiments with BICePs: a Review,” Front. Mol. Biosci. 8, 661520 (2021).
- R. M. Raddi, T. Marshall, Y. Ge, and V. Voelz, “Model Selection Using Replica Averaging with Bayesian Inference of Conformational Populations,” (2023).
- V. a. Voelz and G. Zhou, “Bayesian Inference of Conformational State Populations from Computational Models and Sparse Experimental Observables,” J. Comput. Chem. 35, 2215–2224 (2014).
- H. Wan, Y. Ge, A. Razavi, and V. A. Voelz, “Reconciling Simulated Ensembles of Apomyoglobin with Experimental Hydrogen/Deuterium Exchange Data Using Bayesian Inference and Multiensemble Markov State Models.” J. Chem. Theory Comput. 16, 1333–1348 (2020).
- M. F. D. Hurley, J. D. Northrup, Y. Ge, C. E. Schafmeister, and V. A. Voelz, “Metal Cation-Binding Mechanisms of Q-Proline Peptoid Macrocycles in Solution,” J. Chem. Inf. Model. 61, 2818–2828 (2021).
- R. M. Raddi, Y. Ge, and V. A. Voelz, “BICePs V2. 0: Software for Ensemble Reweighting Using Bayesian Inference of Conformational Populations,” Journal of chemical information and modeling 63, 2370–2381 (2023).
- J. W. Pitera and J. D. Chodera, “On the Use of Experimental Observations to Bias Simulated Ensembles,” Journal of chemical theory and computation 8, 3445–3451 (2012).
- A. Cavalli, C. Camilloni, and M. Vendruscolo, “Molecular Dynamics Simulations with Replica-Averaged Structural Restraints Generate Structural Ensembles According to the Maximum Entropy Principle,” The Journal of chemical physics 138, 03B603 (2013).
- A. Cesari, S. Reißer, and G. Bussi, “Using the Maximum Entropy Principle to Combine Simulations and Solution Experiments,” Computation 6, 15 (2018).
- B. Roux and J. Weare, ‘‘On the Statistical Equivalence of Restrained-Ensemble Simulations with the Maximum Entropy Method,” The Journal of chemical physics 138, 02B616 (2013).
- G. Hummer and J. Köfinger, “Bayesian Ensemble Refinement by Replica Simulations and Reweighting,” The Journal of chemical physics 143, 12B634_1 (2015).
- M. Bonomi, C. Camilloni, A. Cavalli, and M. Vendruscolo, “Metainference: A Bayesian Inference Method for Heterogeneous Systems,” Sci. Adv. 2, e1501177 (2016).
- D. Sivia and J. Skilling, Data Analysis: a Bayesian Tutorial (OUP Oxford, 2006).
- K. A. Dill, S. Bromberg, K. Yue, H. S. Chan, K. M. Ftebig, D. P. Yee, and P. D. Thomas, “Principles of protein folding—a perspective from simple exact models,” Protein science 4, 561–602 (1995).
- R. H. Byrd, P. Lu, J. Nocedal, and C. Zhu, “A limited memory algorithm for bound constrained optimization,” SIAM Journal on scientific computing 16, 1190–1208 (1995).
- S. J. Wright, Numerical optimization (2006).
- V. Beiranvand, W. Hare, and Y. Lucet, “Best Practices for Comparing Optimization Algorithms,” Optim. Eng. 18, 815–848 (2017).
- M. R. Shirts and J. D. Chodera, “Statistically Optimal Analysis of Samples from Multiple Equilibrium States,” J. Chem. Phys. 129, 124105–11 (2008).
- E. Rosta, M. Nowotny, W. Yang, and G. Hummer, “Catalytic mechanism of rna backbone cleavage by ribonuclease h from quantum mechanics/molecular mechanics simulations,” Journal of the American Chemical Society 133, 8934–8941 (2011).
- J. Köfinger, B. Różycki, and G. Hummer, “Inferring Structural Ensembles of Flexible and Dynamic Macromolecules Using Bayesian, Maximum Entropy, and Minimal-Ensemble Refinement Methods,” Biomolecular Simulations: Methods and Protocols , 341–352 (2019).
- S. Bottaro, T. Bengtsen, and K. Lindorff-Larsen, “Integrating Molecular Simulation and Experimental Data: a Bayesian/Maximum Entropy Reweighting Approach,” in Structural Bioinformatics (Springer, 2020) pp. 219–240.
- J. Köfinger and G. Hummer, “Encoding prior knowledge in ensemble refinement,” (2023).
- I. Gilardoni, T. Fröhlking, and G. Bussi, “Boosting ensemble refinement with transferable force field corrections: synergistic optimization for molecular simulations,” arXiv preprint arXiv:2311.13532 (2023).
- C. E. Cavender, D. A. Case, J. C.-H. Chen, L. T. Chong, D. A. Keedy, K. Lindorff-Larsen, D. L. Mobley, O. Ollila, C. Oostenbrink, P. Robustelli, V. A. Voelz, M. E. Wall, D. C. Wych, and M. K. Gilson, “Structure-based experimental datasets for benchmarking of protein simulation force fields,” arXiv preprint arXiv:2303.11056 (2023).
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.