Grappa -- A Machine Learned Molecular Mechanics Force Field (2404.00050v2)
Abstract: Simulating large molecular systems over long timescales requires force fields that are both accurate and efficient. In recent years, E(3) equivariant neural networks have lifted the tension between computational efficiency and accuracy of force fields, but they are still several orders of magnitude more expensive than established molecular mechanics (MM) force fields. Here, we propose Grappa, a machine learning framework to predict MM parameters from the molecular graph, employing a graph attentional neural network and a transformer with symmetry-preserving positional encoding. The resulting Grappa force field outperformstabulated and machine-learned MM force fields in terms of accuracy at the same computational efficiency and can be used in existing Molecular Dynamics (MD) engines like GROMACS and OpenMM. It predicts energies and forces of small molecules, peptides, RNA and - showcasing its extensibility to uncharted regions of chemical space - radicals at state-of-the-art MM accuracy. We demonstrate Grappa's transferability to macromolecules in MD simulations from a small fast folding protein up to a whole virus particle. Our force field sets the stage for biomolecular simulations closer to chemical accuracy, but with the same computational cost as established protein force fields.
- Learning local equivariant representations for large-scale atomistic dynamics, 2022. URL https://arxiv.org/abs/2204.05249.
- Mace: Higher order equivariant message passing neural networks for fast and accurate force fields, 2023.
- Tensornet: Cartesian tensor representations for efficient learning of molecular potentials, 2023.
- Accurate machine learned quantum-mechanical force fields for biomolecular simulations, 2022.
- Equivariant message passing for the prediction of tensorial properties and molecular spectra, 2021. URL https://arxiv.org/abs/2102.03150.
- End-to-end differentiable construction of molecular mechanics force fields. Chemical Science, 13(41):12016–12033, 2022. doi: 10.1039/d2sc02739a. URL https://doi.org/10.1039%2Fd2sc02739a.
- Attention is all you need, 2023. URL https://arxiv.org/abs/1706.03762.
- Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX, 1-2:19–25, 2015. ISSN 2352-7110.
- Openmm 4: A reusable, extensible, hardware independent library for high performance molecular simulation. Journal of Chemical Theory and Computation, 9(1):461–469, 2013. doi: 10.1021/ct300857j. URL https://doi.org/10.1021/ct300857j. PMID: 23316124.
- Espaloma-0.3.0: Machine-learned molecular mechanics force field for the simulation of protein-ligand systems and beyond, 2023.
- Graph attention networks, 2018. URL https://arxiv.org/abs/1710.10903.
- Deep residual learning for image recognition, 2015.
- Layer normalization, 2016.
- On the expressivity role of layernorm in transformers’ attention. arXiv preprint arXiv:2305.02582, 2023.
- Graph transformer networks, 2020.
- Fast, efficient generation of high-quality atomic charges. am1-bcc model: Ii. parameterization and validation. Journal of Computational Chemistry, 23(16):1623–1641, 2002. doi: https://doi.org/10.1002/jcc.10128. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/jcc.10128.
- Improved side-chain torsion potentials for the amber ff99sb protein force field. Proteins, 78(8):1950–1958, Jun 2010. doi: 10.1002/prot.22711. URL https://doi.org/10.1002/prot.22711.
- Schnet: A continuous-filter convolutional neural network for modeling quantum interactions, 2017. URL https://arxiv.org/abs/1706.08566.
- Sobolev training for neural networks, 2017.
- Development and benchmarking of open force field 2.0.0 — the sage small molecule force field. ChemRxiv, 2022. doi: 10.1021/acs.jctc.3c00039. URL https://doi.org/10.1021/acs.jctc.3c00039. This content is a preprint and has not been peer-reviewed.
- A fast and high-quality charge model for the next generation general amber force field. The Journal of chemical physics, 153:114502, 09 2020. doi: 10.1063/5.0019056. URL https://doi.org/10.1063/5.0019056.
- ff14sb: Improving the accuracy of protein side chain and backbone parameters from ff99sb. Journal of Chemical Theory and Computation, 11(8):3696–3713, 2015. ISSN 1549-9618. doi: 10.1021/acs.jctc.5b00255. URL https://doi.org/10.1021/acs.jctc.5b00255.
- Refinement of the cornell et al. nucleic acids force field based on reference quantum chemical calculations of glycosidic torsion profiles. Journal of Chemical Theory and Computation, 7(9):2886–2902, 2011. ISSN 1549-9618. doi: 10.1021/ct200162x. URL https://doi.org/10.1021/ct200162x.
- Spice, a dataset of drug-like molecules and peptides for training machine learning potentials. Scientific Data, 10(11), 2023. URL https://doi.org/10.1038/s41597-022-01882-6.
- How fast-folding proteins fold. Science, 334(6055):517–520, 2011. doi: 10.1126/science.1208351. URL https://www.science.org/doi/abs/10.1126/science.1208351.
- Hybrid kinetic monte carlo/molecular dynamics simulations of bond scissions in proteins. Journal of Chemical Theory and Computation, 16(1):553–563, 2020. doi: 10.1021/acs.jctc.9b00786. URL https://doi.org/10.1021/acs.jctc.9b00786. PMID: 31738552.
- Pytorch: An imperative style, high-performance deep learning library, 2019.
- Deep graph library: A graph-centric, highly-performant package for graph neural networks, 2020.
- Fast and accurate deep network learning by exponential linear units (elus), 2016.
- Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15:1929–1958, 06 2014.
- Adam: A method for stochastic optimization, 2017.
- Sgdr: Stochastic gradient descent with warm restarts, 2017.
- New formulation for derivatives of torsion angles and improper torsion angles in molecular mechanics: Elimination of singularities. Journal of Computational Chemistry, 17(9):1132–1141, 1996.
- Psi4 1.4: Open-source software for high-throughput quantum chemistry. J. Chem. Phys., 2020. doi: 10.1063/5.0006002.
- Berk Hess. P-LINCS: A parallel linear constraint solver for molecular simulation. 4(1):116–122. ISSN 1549-9618, 1549-9626. doi: 10.1021/ct700200b. URL https://pubs.acs.org/doi/10.1021/ct700200b.
- Canonical sampling through velocity rescaling. 126(1):014101. ISSN 0021-9606, 1089-7690. doi: 10.1063/1.2408420.
- M. Parrinello and A. Rahman. Polymorphic transitions in single crystals: A new molecular dynamics method. 52(12):7182–7190. ISSN 0021-8979. doi: 10.1063/1.328693. URL https://doi.org/10.1063/1.328693.
- Crystal structure of a ten-amino acid protein. Journal of the American Chemical Society, 130(46):15327–15331, 2008. doi: 10.1021/ja8030533. URL https://doi.org/10.1021/ja8030533. PMID: 18950166.
- Leif Seute (3 papers)
- Eric Hartmann (1 paper)
- Jan Stühmer (13 papers)
- Frauke Gräter (4 papers)