Overcoming systematic softening in universal machine learning interatomic potentials by fine-tuning (2405.07105v1)

Published 11 May 2024 in cond-mat.mtrl-sci, cs.AI, and cs.LG

Abstract: Machine learning interatomic potentials (MLIPs) have introduced a new paradigm for atomic simulations. Recent advancements have seen the emergence of universal MLIPs (uMLIPs) that are pre-trained on diverse materials datasets, providing opportunities for both ready-to-use universal force fields and robust foundations for downstream machine learning refinements. However, their performance in extrapolating to out-of-distribution complex atomic environments remains unclear. In this study, we highlight a consistent potential energy surface (PES) softening effect in three uMLIPs: M3GNet, CHGNet, and MACE-MP-0, which is characterized by energy and force under-prediction in a series of atomic-modeling benchmarks including surfaces, defects, solid-solution energetics, phonon vibration modes, ion migration barriers, and general high-energy states. We find that the PES softening behavior originates from a systematic underprediction error of the PES curvature, which derives from the biased sampling of near-equilibrium atomic arrangements in uMLIP pre-training datasets. We demonstrate that the PES softening issue can be effectively rectified by fine-tuning with a single additional data point. Our findings suggest that a considerable fraction of uMLIP errors are highly systematic, and can therefore be efficiently corrected. This result rationalizes the data-efficient fine-tuning performance boost commonly observed with foundational MLIPs. We argue for the importance of a comprehensive materials dataset with improved PES sampling for next-generation foundational MLIPs.

References (28)

Authors (9)

Bowen Deng (30 papers)
Yunyeong Choi (2 papers)
Peichen Zhong (16 papers)
Janosh Riebesell (10 papers)
Shashwat Anand (7 papers)
Zhuohan Li (29 papers)
KyuJung Jun (8 papers)
Kristin A. Persson (49 papers)
Gerbrand Ceder (72 papers)

Citations (10)

View on Semantic Scholar

Summary

Overcoming Systematic Softening in Machine Learning Interatomic Potentials

Introduction

In the field of materials science, creating accurate simulations at the atomic level is a complex but essential task. Recently, Machine Learning Interatomic Potentials (MLIPs) have gained momentum as a means to bridge the gap between precise quantum mechanical simulations and feasible computational costs. However, despite advancements, a critical challenge remains: how well these models extrapolate to complex atomic environments that were not part of their training data.

This paper evaluates how three specific universal MLIPs (uMLIPs)—M3GNet, CHGNet, and MACE-MP-0—perform on challenging cases, what systematic errors are common, and how simple fine-tuning methods can significantly improve their predictions.

MLIP Framework and Common Challenges

First, let's unpack how these MLIPs work. The total energy $E$ of a system in MLIPs is approximated as a sum of atomic contributions. Forces on each atom are derived as gradients of this energy:

1 2	E = \sum_i^n E_i(\{\vec{r}_i\}, \{C_i\}), f_i = - \frac{\partial E}{\partial \vec{r}_i}

Here, each

E_i

is a function that predicts the energy contribution of atom

i

based on its position (

\{\vec{r}_i\}

) and chemical identity (

\{C_i\}

).

However, the Potential Energy Surface (PES) shows systematic "softening"—basically, the models underpredict the energies and forces in high-energy configurations. This softening is traceable to biased training datasets, which mostly include near-equilibrium atomic arrangements, limiting how well MLIPs can generalize to unseen environments.

Results Across Different Modeling Tasks

The authors extensively tested the uMLIPs on several tasks:

Surface Energies

For predicting surface energies—a vital factor in applications like catalysis and thin film growth—the uMLIPs consistently underestimated the surface energies compared to Density Functional Theory (DFT) calculations.

Defect Energies

Similarly, for modeling point defects, which impact a material's properties significantly, uMLIPs again tended to underpredict the defect energies compared to DFT norms.

Solid-Solution Energetics

Evaluating the mixing energetics in systems like the Ca $_x$ Mg $_{2-x}$ O $_2$ solid solution, uMLIPs underpredicted the interaction energies, leading to inaccuracies in modeling phases and solubility behaviors.

Phonon Properties

Phonon frequencies, which dictate vibrational modes in materials, were another area of underprediction, especially for the highest-energy vibrational modes, suggesting a softer PES in high-energy regions.

Ion Migration Barriers

Ion migration, critical for understanding ionic conductors and battery materials, showed the same softening trend, with barriers systematically underestimated by all models.

Quantifying the Softening

To quantify the degree of PES softening, the authors examined the relationship between predicted forces and corresponding DFT calculations. They introduced a "softening scale," derived from the slope of the force prediction plot. A scale of less than 1 indicates softening.

Notably, over 90% of 1,000 compounds tested exhibited this softening across all three uMLIPs.

Data-Efficient Fine-Tuning

Here's the good news: a significant portion of these errors can be corrected with minimal additional training data. By fine-tuning the models with even a single new data point in high-energy regions, the softening can be mitigated effectively.

For instance, fine-tuning the CHGNet model on a single high-energy data point rotated the force prediction distribution back towards the diagonal, vastly improving prediction accuracy. Extending fine-tuning to ten new data points resulted in even further error reduction.

Implications and Future Directions

The paper underscores two major points:

Systematic Softening: The PES softening is a prevalent issue across current uMLIPs, affecting predictions on various complex tasks.
Fine-Tuning: Addressing these errors is feasible and efficient, with even minimal additional data points significantly improving model performance.

Future efforts should focus on generating more comprehensive datasets that cover a broader range of atomic configurations, especially high-energy states. Moreover, scaling up the model size could help mitigate the softening effect, as larger models like MACE tended to perform better.

In conclusion, while current uMLIPs exhibit PES softening limiting their accuracy in high-energy atomic configurations, this is not an insurmountable obstacle. Data-efficient fine-tuning presents a practical solution, making these models robust tools for atomistic simulations and materials discovery.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/cedergroup/status/1790430364520870009

https://twitter.com/TimothyDuignan/status/1791101660959490059

https://twitter.com/fly51fly/status/1791948571660464593

https://twitter.com/zhongpc/status/1806401429600641463

https://twitter.com/zhongpc/status/1797690195485306939

https://twitter.com/KOLKATA1968/status/1792088919137501337