MACE-OFF: Transferable Short Range Machine Learning Force Fields for Organic Molecules (2312.15211v4)
Abstract: Classical empirical force fields have dominated biomolecular simulation for over 50 years. Although widely used in drug discovery, crystal structure prediction, and biomolecular dynamics, they generally lack the accuracy and transferability required for first-principles predictive modeling. In this paper, we introduce MACE-OFF, a series of short range transferable force fields for organic molecules created using state-of-the-art machine learning technology and first-principles reference data computed with a high level of quantum mechanical theory. MACE-OFF demonstrates the remarkable capabilities of short range models by accurately predicting a wide variety of gas and condensed phase properties of molecular systems. It produces accurate, easy-to-converge dihedral torsion scans of unseen molecules, as well as reliable descriptions of molecular crystals and liquids, including quantum nuclear effects. We further demonstrate the capabilities of MACE-OFF by determining free energy surfaces in explicit solvent, as well as the folding dynamics of peptides.Finally, we simulate a fully solvated small protein, observing accurate secondary structure and vibrational spectrum. These developments enable first-principles simulations of molecular systems for the broader chemistry community at high accuracy and relatively low computational cost.
- A. V. Shapeev, Moment tensor potentials: A class of systematically improvable interatomic potentials, Multiscale Modeling & Simulation 14, 1153 (2016).
- J. Behler, Four generations of high-dimensional neural network potentials, Chemical Reviews 121, 10037 (2021).
- B. Huang and O. A. Von Lilienfeld, Ab initio machine learning in chemical compound space, Chemical reviews 121, 10001 (2021).
- A. T. Hagler, Force field development phase ii: Relaxation of physics-based criteria… or inclusion of more rigorous physics into the representation of molecular energetics, J Comput Aided Mol Des 33, 205–264 (2019).
- C. Bannwarth, S. Ehlert, and S. Grimme, Gfn2-xtb—an accurate and broadly parametrized self-consistent tight-binding quantum chemical method with multipole electrostatics and density-dependent dispersion contributions, Journal of chemical theory and computation 15, 1652 (2019).
- J. S. Smith, O. Isayev, and A. E. Roitberg, Ani-1: an extensible neural network potential with dft accuracy at force field computational cost, Chemical science 8, 3192 (2017a).
- J. Behler and M. Parrinello, Generalized neural-network representation of high-dimensional potential-energy surfaces, Physical review letters 98, 146401 (2007).
- T. Plé , L. Lagardère, and J.-P. Piquemal, Force-field-enhanced neural network interactions: from local equivariant embedding to atom-in-molecule properties and long-range effects, Chemical Science 10.1039/d3sc02581k (2023).
- M. Thürlemann and S. Riniker, Hybrid classical/machine-learning force fields for the accurate description of molecular condensed-phase systems, Chemical Science 14, 12661 (2023).
- K. Schütt, O. Unke, and M. Gastegger, Equivariant message passing for the prediction of tensorial properties and molecular spectra, in International Conference on Machine Learning (PMLR, 2021) pp. 9377–9388.
- J. Gasteiger, F. Becker, and S. Günnemann, Gemnet: Universal directional graph neural networks for molecules, Advances in Neural Information Processing Systems 34, 6790 (2021).
- M. J. Willatt, F. Musil, and M. Ceriotti, Feature optimization for atomistic machine learning yields a data-driven construction of the periodic table of the elements, Physical Chemistry Chemical Physics 20, 29661 (2018).
- E. Wigner, Group theory: and its application to the quantum mechanics of atomic spectra, Vol. 5 (Elsevier, 2012).
- A. Najibi and L. Goerigk, The nonlocal kernel in van der waals density functionals as an additive correction: An extensive analysis with special emphasis on the b97m-v and ω𝜔\omegaitalic_ωb97m-v approaches, Journal of Chemical Theory and Computation 14, 5725 (2018).
- F. Weigend and R. Ahlrichs, Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for h to rn: Design and assessment of accuracy, Physical Chemistry Chemical Physics 7, 3297 (2005).
- D. Rappoport and F. Furche, Property-optimized gaussian basis sets for molecular response calculations, The Journal of chemical physics 133 (2010).
- S. Grimme, S. Ehrlich, and L. Goerigk, Effect of the damping function in dispersion corrected density functional theory, Journal of computational chemistry 32, 1456 (2011).
- S.-L. J. Lahey, T. N. Thien Phuc, and C. N. Rowley, Benchmarking force field and the ani neural network potentials for the torsional potential energy surface of biaryl drug fragments, Journal of Chemical Information and Modeling 60, 6258 (2020).
- B. A. Kolesov, M. A. Mikhailenko, and E. V. Boldyreva, Dynamics of the intermolecular hydrogen bonds in the polymorphs of paracetamol in relation to crystal packing and conformational transitions: a variable-temperature polarized raman spectroscopy study, Physical Chemistry Chemical Physics 13, 14243 (2011).
- N. Raimbault, V. Athavale, and M. Rossi, Anharmonic effects in the low-frequency vibrational modes of aspirin and paracetamol crystals, Physical Review Materials 3, 10.1103/physrevmaterials.3.053605 (2019c).
- G. A. Dolgonos, J. Hoja, and A. D. Boese, Revised values for the x23 benchmark set of molecular crystals, Physical Chemistry Chemical Physics 21, 24333 (2019).
- G. R. Medders, V. Babin, and F. Paesani, Development of a “First-Principles” Water Potential with Flexible Monomers. III. Liquid Phase Properties, Journal of Chemical Theory and Computation 10, 2906 (2014).
- A. K. Soper, The radial distribution functions of water as derived from radiation total scattering experiments: Is there anything we can say for sure?, ISRN Physical Chemistry 2013, 1–67 (2013).
- S. Zhang, R. Schweitzer-Stenner, and B. Urbanc, Do Molecular Dynamics Force Fields Capture Conformational Dynamics of Alanine in Water?, Journal of Chemical Theory and Computation 16, 510 (2020).
- Y. Zhang and C. Sagui, Secondary structure assignment for conformationally irregular peptides: Comparison between dssp, stride and kaksi, Journal of Molecular Graphics and Modelling 55, 72 (2015).
- K. N. Woods, The glassy state of crambin and the THz time scale protein-solvent fluctuations possibly related to protein function, BMC Biophysics 7, 8 (2014).
- L.-P. Wang and C. Song, Geometry optimization made simple with translation and rotation coordinates, The Journal of chemical physics 144 (2016).
- N. Raimbault, V. Athavale, and M. Rossi, Anharmonic effects in the low-frequency vibrational modes of aspirin and paracetamol crystals, Physical Review Materials 3, 053605 (2019d).
- M. Brehm and B. Kirchner, TRAVIS - A Free Analyzer and Visualizer for Monte Carlo and Molecular Dynamics Trajectories, Journal of Chemical Information and Modeling 51, 2007 (2011).
- M. Thomas, M. Brehm, and B. Kirchner, Voronoi dipole moments for the simulation of bulk phase vibrational spectra, Physical Chemistry Chemical Physics 17, 3207 (2015).