Overcoming systematic softening in universal machine learning interatomic potentials by fine-tuning (2405.07105v1)
Abstract: Machine learning interatomic potentials (MLIPs) have introduced a new paradigm for atomic simulations. Recent advancements have seen the emergence of universal MLIPs (uMLIPs) that are pre-trained on diverse materials datasets, providing opportunities for both ready-to-use universal force fields and robust foundations for downstream machine learning refinements. However, their performance in extrapolating to out-of-distribution complex atomic environments remains unclear. In this study, we highlight a consistent potential energy surface (PES) softening effect in three uMLIPs: M3GNet, CHGNet, and MACE-MP-0, which is characterized by energy and force under-prediction in a series of atomic-modeling benchmarks including surfaces, defects, solid-solution energetics, phonon vibration modes, ion migration barriers, and general high-energy states. We find that the PES softening behavior originates from a systematic underprediction error of the PES curvature, which derives from the biased sampling of near-equilibrium atomic arrangements in uMLIP pre-training datasets. We demonstrate that the PES softening issue can be effectively rectified by fine-tuning with a single additional data point. Our findings suggest that a considerable fraction of uMLIP errors are highly systematic, and can therefore be efficiently corrected. This result rationalizes the data-efficient fine-tuning performance boost commonly observed with foundational MLIPs. We argue for the importance of a comprehensive materials dataset with improved PES sampling for next-generation foundational MLIPs.
- J. Gasteiger, J. Groß, and S. Günnemann, Directional message passing for molecular graphs, ICLR 10.48550/arxiv.2003.03123 (2020).
- C. Chen and S. P. Ong, A universal graph deep learning interatomic potential for the periodic table, Nature Computational Science 2, 718–728 (2022).
- B. Focassio, L. P. M. Freitas, and G. R. Schleder, Performance assessment of universal machine learning interatomic potentials: Challenges and directions for materials’ surfaces, arXiv preprint arXiv:2403.04217 (2024).
- T. Xie and J. C. Grossman, Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties, Physical Review Letters 120, 145301 (2018).
- S. Luo, T. Chen, and A. S. Krishnapriyan, Enabling efficient equivariant operations in the fourier basis via gaunt tensor products, arXiv preprint arXiv:2401.10216 (2024).
- B. Cheng, Cartesian atomic cluster expansion for machine learning interatomic potentials, arXiv preprint arXiv:2402.07472 (2024).
- I. Obot, D. Macdonald, and Z. Gasem, Density functional theory (dft) as a powerful tool for designing new organic corrosion inhibitors. part 1: An overview, Corrosion Science 99, 1–30 (2015).
- K. A. Fichthorn and M. Scheffler, Island nucleation in thin-film epitaxy: A first-principles investigation, Physical Review Letters 84, 5371–5374 (2000).
- V. Gurylev and T. P. Perng, Defect engineering of zno: Review on oxygen and zinc vacancies, Journal of the European Ceramic Society 41, 4977–4996 (2021).
- K. Kang and G. Ceder, Factors that affect li mobility in layered lithium transition metal oxides, Physical Review B 74, 094105 (2006).
- A. van de Walle, Multicomponent multisublattice alloys, nonconfigurational entropy and other additions to the Alloy Theoretic Automated Toolkit, Calphad 33, 266 (2009).
- Cation-disordered rocksalt-type high-entropy cathodes for Li-ion batteries, Nature Materials 20, 214 (2021).
- G. Ceder, A derivation of the ising model for the computation of phase diagrams, Computational Materials Science 1, 144–150 (1993).
- I.-H. Jung, S. A. Decterov, and A. D. Pelton, Critical thermodynamic evaluation and optimization of the cao–mgo–sio2 system, Journal of the European Ceramic Society 25, 313–333 (2005).
- A. v. d. Walle and G. Ceder, The effect of lattice vibrations on substitutional alloy thermodynamics, Reviews of Modern Physics 74, 11–45 (2002).
- A. Togo and I. Tanaka, First principles phonon calculations in materials science, Scripta Materialia 108, 1–5 (2015).
- K. Parlinski, Z. Q. Li, and Y. Kawazoe, First-Principles Determination of the Soft Mode in Cubic ZrO2, Physical Review Letters 78, 4063–4066 (1997).
- A. Togo, First-principles Phonon Calculations with Phonopy and Phono3py, Journal of the Physical Society of Japan 92, 012001.
- Z. Deng, B. Radhakrishnan, and S. P. Ong, Rational Composition Optimization of the Lithium-Rich Li3OCl1–x Br x Anti-Perovskite Superionic Conductors, Chemistry of Materials 27, 3749–3755 (2015).
- J.-X. Shen, M. Horton, and K. A. Persson, A charge-density-based general cation insertion algorithm for generating new li-ion cathode materials, npj Computational Materials 6, 161 (2020).
- H.-C. Wang, S. Botti, and M. A. L. Marques, Predicting stable crystalline compounds using chemical similarity, npj Computational Materials 7, 12 (2021).
- C. J. Bartel, Data-centric approach to improve machine learning models for inorganic materials, Patterns 2, 100382 (2021).
- T. W. Ko and S. P. Ong, Recent advances and outstanding challenges for machine learning interatomic potentials, Nature Computational Science , 1–3 (2023).
- W. Sun and G. Ceder, Efficient creation and convergence of surface slabs, Surface Science 617, 53–59 (2013).
- G. Henkelman, B. P. Uberuaga, and H. Jónsson, A climbing image nudged elastic band method for finding saddle points and minimum energy paths, The Journal of Chemical Physics 113, 9901–9904 (2000).
- D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, arXiv 10.48550/arxiv.1412.6980 (2014).
- G. Kresse and J. Furthmüller, Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set, Computational Materials Science 6, 15 (1996).
- G. Kresse and D. Joubert, From ultrasoft pseudopotentials to the projector augmented-wave method, Physical Review B 59, 1758 (1999).
- Bowen Deng (30 papers)
- Yunyeong Choi (2 papers)
- Peichen Zhong (16 papers)
- Janosh Riebesell (10 papers)
- Shashwat Anand (7 papers)
- Zhuohan Li (29 papers)
- KyuJung Jun (8 papers)
- Kristin A. Persson (49 papers)
- Gerbrand Ceder (72 papers)