Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MACE-OFF: Transferable Short Range Machine Learning Force Fields for Organic Molecules (2312.15211v4)

Published 23 Dec 2023 in physics.chem-ph

Abstract: Classical empirical force fields have dominated biomolecular simulation for over 50 years. Although widely used in drug discovery, crystal structure prediction, and biomolecular dynamics, they generally lack the accuracy and transferability required for first-principles predictive modeling. In this paper, we introduce MACE-OFF, a series of short range transferable force fields for organic molecules created using state-of-the-art machine learning technology and first-principles reference data computed with a high level of quantum mechanical theory. MACE-OFF demonstrates the remarkable capabilities of short range models by accurately predicting a wide variety of gas and condensed phase properties of molecular systems. It produces accurate, easy-to-converge dihedral torsion scans of unseen molecules, as well as reliable descriptions of molecular crystals and liquids, including quantum nuclear effects. We further demonstrate the capabilities of MACE-OFF by determining free energy surfaces in explicit solvent, as well as the folding dynamics of peptides.Finally, we simulate a fully solvated small protein, observing accurate secondary structure and vibrational spectrum. These developments enable first-principles simulations of molecular systems for the broader chemistry community at high accuracy and relatively low computational cost.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. A. V. Shapeev, Moment tensor potentials: A class of systematically improvable interatomic potentials, Multiscale Modeling & Simulation 14, 1153 (2016).
  2. J. Behler, Four generations of high-dimensional neural network potentials, Chemical Reviews 121, 10037 (2021).
  3. B. Huang and O. A. Von Lilienfeld, Ab initio machine learning in chemical compound space, Chemical reviews 121, 10001 (2021).
  4. A. T. Hagler, Force field development phase ii: Relaxation of physics-based criteria… or inclusion of more rigorous physics into the representation of molecular energetics, J Comput Aided Mol Des 33, 205–264 (2019).
  5. C. Bannwarth, S. Ehlert, and S. Grimme, Gfn2-xtb—an accurate and broadly parametrized self-consistent tight-binding quantum chemical method with multipole electrostatics and density-dependent dispersion contributions, Journal of chemical theory and computation 15, 1652 (2019).
  6. J. S. Smith, O. Isayev, and A. E. Roitberg, Ani-1: an extensible neural network potential with dft accuracy at force field computational cost, Chemical science 8, 3192 (2017a).
  7. J. Behler and M. Parrinello, Generalized neural-network representation of high-dimensional potential-energy surfaces, Physical review letters 98, 146401 (2007).
  8. T. Plé , L. Lagardère, and J.-P. Piquemal, Force-field-enhanced neural network interactions: from local equivariant embedding to atom-in-molecule properties and long-range effects, Chemical Science 10.1039/d3sc02581k (2023).
  9. M. Thürlemann and S. Riniker, Hybrid classical/machine-learning force fields for the accurate description of molecular condensed-phase systems, Chemical Science 14, 12661 (2023).
  10. K. Schütt, O. Unke, and M. Gastegger, Equivariant message passing for the prediction of tensorial properties and molecular spectra, in International Conference on Machine Learning (PMLR, 2021) pp. 9377–9388.
  11. J. Gasteiger, F. Becker, and S. Günnemann, Gemnet: Universal directional graph neural networks for molecules, Advances in Neural Information Processing Systems 34, 6790 (2021).
  12. M. J. Willatt, F. Musil, and M. Ceriotti, Feature optimization for atomistic machine learning yields a data-driven construction of the periodic table of the elements, Physical Chemistry Chemical Physics 20, 29661 (2018).
  13. E. Wigner, Group theory: and its application to the quantum mechanics of atomic spectra, Vol. 5 (Elsevier, 2012).
  14. A. Najibi and L. Goerigk, The nonlocal kernel in van der waals density functionals as an additive correction: An extensive analysis with special emphasis on the b97m-v and ω𝜔\omegaitalic_ωb97m-v approaches, Journal of Chemical Theory and Computation 14, 5725 (2018).
  15. F. Weigend and R. Ahlrichs, Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for h to rn: Design and assessment of accuracy, Physical Chemistry Chemical Physics 7, 3297 (2005).
  16. D. Rappoport and F. Furche, Property-optimized gaussian basis sets for molecular response calculations, The Journal of chemical physics 133 (2010).
  17. S. Grimme, S. Ehrlich, and L. Goerigk, Effect of the damping function in dispersion corrected density functional theory, Journal of computational chemistry 32, 1456 (2011).
  18. S.-L. J. Lahey, T. N. Thien Phuc, and C. N. Rowley, Benchmarking force field and the ani neural network potentials for the torsional potential energy surface of biaryl drug fragments, Journal of Chemical Information and Modeling 60, 6258 (2020).
  19. B. A. Kolesov, M. A. Mikhailenko, and E. V. Boldyreva, Dynamics of the intermolecular hydrogen bonds in the polymorphs of paracetamol in relation to crystal packing and conformational transitions: a variable-temperature polarized raman spectroscopy study, Physical Chemistry Chemical Physics 13, 14243 (2011).
  20. N. Raimbault, V. Athavale, and M. Rossi, Anharmonic effects in the low-frequency vibrational modes of aspirin and paracetamol crystals, Physical Review Materials 3, 10.1103/physrevmaterials.3.053605 (2019c).
  21. G. A. Dolgonos, J. Hoja, and A. D. Boese, Revised values for the x23 benchmark set of molecular crystals, Physical Chemistry Chemical Physics 21, 24333 (2019).
  22. G. R. Medders, V. Babin, and F. Paesani, Development of a “First-Principles” Water Potential with Flexible Monomers. III. Liquid Phase Properties, Journal of Chemical Theory and Computation 10, 2906 (2014).
  23. A. K. Soper, The radial distribution functions of water as derived from radiation total scattering experiments: Is there anything we can say for sure?, ISRN Physical Chemistry 2013, 1–67 (2013).
  24. S. Zhang, R. Schweitzer-Stenner, and B. Urbanc, Do Molecular Dynamics Force Fields Capture Conformational Dynamics of Alanine in Water?, Journal of Chemical Theory and Computation 16, 510 (2020).
  25. Y. Zhang and C. Sagui, Secondary structure assignment for conformationally irregular peptides: Comparison between dssp, stride and kaksi, Journal of Molecular Graphics and Modelling 55, 72 (2015).
  26. K. N. Woods, The glassy state of crambin and the THz time scale protein-solvent fluctuations possibly related to protein function, BMC Biophysics 7, 8 (2014).
  27. L.-P. Wang and C. Song, Geometry optimization made simple with translation and rotation coordinates, The Journal of chemical physics 144 (2016).
  28. N. Raimbault, V. Athavale, and M. Rossi, Anharmonic effects in the low-frequency vibrational modes of aspirin and paracetamol crystals, Physical Review Materials 3, 053605 (2019d).
  29. M. Brehm and B. Kirchner, TRAVIS - A Free Analyzer and Visualizer for Monte Carlo and Molecular Dynamics Trajectories, Journal of Chemical Information and Modeling 51, 2007 (2011).
  30. M. Thomas, M. Brehm, and B. Kirchner, Voronoi dipole moments for the simulation of bulk phase vibrational spectra, Physical Chemistry Chemical Physics 17, 3207 (2015).
Citations (29)

Summary

  • The paper demonstrates MACE-OFF23 as a transferable ML force field that achieves near chemical accuracy in predicting energies and forces for organic molecules.
  • It leverages a higher-order equivariant message-passing neural network using spherical harmonic basis inputs to model local, short-range interactions from quantum data.
  • Its robust performance is verified through accurate dihedral scans, molecular crystal property predictions, and simulations of biomolecular dynamics, enhancing drug discovery and materials science.

An Analysis of MACE-OFF23: Transferable Machine Learning Force Fields for Organic Molecules

The paper "MACE-OFF23: Transferable Machine Learning Force Fields for Organic Molecules" presents a new development in computational chemistry, focusing on enhancing the accuracy and transferability of force fields for organic molecular systems using ML techniques. The paper introduces MACE-OFF23, a series of force fields designed to provide higher accuracy in predicting the properties of organic molecules compared to traditional empirical force fields.

Background and Motivation

Empirical force fields have been integral to molecular simulations for over five decades, particularly in drug discovery and biomolecular dynamics. However, the limitations in accuracy and transferability of these models have hindered predictive modeling applications. Machine learning force fields have recently gained attention due to improvements in accuracy and computational efficiency, offering an alternative to density functional theory (DFT) in materials chemistry. MACE-OFF23 leverages state-of-the-art ML methods, underpinned by quantum mechanical data, to address these challenges.

Methodology

MACE-OFF23 is based on the MACE architecture, which is a higher-order equivariant message-passing neural network. The model predicts the potential energy of molecular systems by leveraging input features mapped in a spherical harmonic basis and employs a local, short-range approach to modeling. The authors present three versions of MACE-OFF23—small, medium, and large—each varying in expressivity and computational cost to suit different simulation requirements.

The training dataset, a subset of the SPICE dataset, includes configurations for molecules containing up to 50 atoms, augmented with larger molecules from the QMugs dataset and water cluster data for improved non-bonded interaction modeling. The MACE models were trained to reproduce energies and forces from high-level quantum mechanics reference data.

Results

The paper evaluates MACE-OFF23 against several criteria:

  1. Prediction Accuracy: The model outperforms existing ML models, such as ANI-2x and AIMNet, on various datasets, achieving near chemical accuracy on energy predictions. Particularly notable is the model's strong performance in predicting intermolecular forces, crucial for accurate condensed-phase simulations.
  2. Dihedral Scans: MACE-OFF23 accurately predicts torsional energy profiles, which are challenging for traditional force fields due to their complex potential energy surfaces.
  3. Molecular Crystal Properties: MACE-OFF23 effectively predicts properties such as lattice enthalpies and vibrational spectra, demonstrating improved accuracy over empirical models.
  4. Simulation of Condensed Phases: The medium MACE model predicts liquid densities and heats of vaporization with notable accuracy, correlating well with experimental data.
  5. Biomolecular Dynamics: The small MACE model captures folding dynamics of peptides and the vibrational modes of solvated proteins, which is critical for biological simulations.

The computational performance of MACE-OFF23 is competitive, with scalable performance demonstrated through LAMMPS and OpenMM implementations.

Implications and Future Directions

The development of MACE-OFF23 has significant implications for the field of computational chemistry. The accuracy and efficiency of this ML force field can enhance simulations within drug design, materials science, and beyond, offering a feasible alternative to both classical force fields and quantum mechanical methods. Future advancements could involve integrating long-range interactions and extending the applicability to charged systems, paving the way for broader biomolecular simulations.

Conclusion

MACE-OFF23 represents a sophisticated and transferable solution for simulating organic molecules with high accuracy and low computational cost. This innovation in machine learning force fields stands to invigorate research across varied chemical domains, providing a powerful tool for accurately predicting molecular behaviors in diverse environments.

Youtube Logo Streamline Icon: https://streamlinehq.com