Machine Learning Potentials (MLPs)

Updated 12 November 2025

Machine Learning Potentials are data-driven surrogate models that simulate atomic interactions with quantum-mechanical accuracy and statistical scalability.
They employ information-theoretic motif sampling to uniformly cover diverse chemical environments, enhancing predictive transferability across compositions.
The PACE framework uses a linear atomic cluster expansion with advanced regularization to achieve efficient training and precise energy and force predictions.

Machine learning potentials (MLPs) are data-driven surrogate models that enable atomistic simulations with quantum-mechanical accuracy and statistical-mechanics scalability. MLPs have revolutionized structure/property prediction, dynamics, and phase diagram determination in an array of chemically complex and disordered materials, including multicomponent alloys. A central challenge in this domain is achieving predictive fidelity across the full compositional and structural landscape of such alloys, from perfectly ordered stoichiometric compounds to maximally disordered solid solutions. The integration of information-theoretic sampling strategies with advanced body-ordered descriptors—most notably, the Performant Atomic Cluster Expansion (PACE)—provides an effective means of constructing robust, transferable MLPs for alloys that span this range, as exemplified by recent developments in motif-based sampling (MBS) (Sheriff et al., 14 Jun 2025).

1. Information-Theoretic Sampling of Local Chemical Motifs

The critical link between the local environment of an atom and the macroscopic properties of a material motivates a formal definition of "motifs": each atom’s first coordination polyhedron, labeled by the species of its nearest neighbors, defines a discrete motif $m$ . In a database of $N$ local environments, the empirical motif frequency is

$P(m) = \frac{\#\{\text{occurrences of }m\}}{N}$

The combinatorial space of possible motifs for a multicomponent system is vast, and uniform coverage is neither guaranteed nor trivial to realize with random structure enumeration, especially for rare or complex motifs associated with certain compositions or short-range orders.

To rectify sampling bias, the motif distribution $P(m)$ is explicitly compared to the uniform reference $U(m)=1/M$ (with $M$ the total number of motifs) using the Jensen–Shannon divergence,

$D_{\mathrm{JS}}(P\| U) = \frac{1}{2} D_{\mathrm{KL}}(P\| M) + \frac{1}{2} D_{\mathrm{KL}}(U\| M)$

where $M = \frac{1}{2}(P+U)$ and $D_{\mathrm{KL}}(P\|Q) = \sum_{m} P(m) \log_2 \frac{P(m)}{Q(m)}$ .

Maximizing the Shannon entropy $H(P) = -\sum_m P(m) \log_2 P(m)$ , which is equivalent to minimizing $D_{\mathrm{JS}}(P\|U)$ , is operationalized through the motif-based sampling (MBS) algorithm:

Generate an initial pool of alloy structures across the full composition simplex (e.g., fcc random substitutions).
Within each structure, perform atom swaps to increase $H(P)$ .
Iterate until $P(m)$ is acceptably close to uniform across $m$ .

This approach ensures that even rare coordination environments—including those present in ordered or highly short-range-ordered regimes—are systematically incorporated into the training set.

2. Model Architecture: The Linear Atomic Cluster Expansion (PACE)

The PACE framework implements a linear atomic cluster expansion (ACE), which provides a systematically improvable, permutation- and rotation-invariant decomposition of the total energy: $E_{\mathrm{tot}} = \sum_i E_i, \quad E_i = \sum_\alpha c_\alpha \Phi_\alpha(\mathcal{R}_i)$ where $\Phi_\alpha$ are body-ordered cluster basis functions built from one- through $n$ -body descriptors of atom $i$ ’s neighbor coordinates $\mathcal{R}_i = \{r_{ij}, \hat{\mathbf{r}}_{ij}, Z_j\}$ . Each $\Phi_\alpha$ incorporates radial basis functions $g_n(r)$ , spherical harmonics $Y_{\ell m}(\hat{\mathbf{r}})$ , and Chebyshev polynomials. For two-body terms,

$A_{i,n\ell m}^{(\alpha)} = \sum_{j\ne i} \delta_{Z_i,Z_j} \, g_n(r_{ij}) Y_{\ell m}(\hat{\mathbf{r}}_{ij})$

Higher-order (three-body and beyond) invariants are formed via tensor contractions of these $A_{i,\ldots}$ . The PACE codebase automates construction and pruning of the resulting large basis set, ensuring computational tractability and expressivity.

3. Training Workflow and Regularization

PACE-based MLPs are trained by simultaneous matching of reference DFT energies and forces: $\mathcal{L} = \sum_{s \in \text{train}} w_E [ E_s^{\text{pred}} - E_s^{\text{DFT}} ]^2 + w_F \sum_{s} \sum_{i \in s} \Vert \mathbf{F}_{s,i}^{\text{pred}} - \mathbf{F}_{s,i}^{\text{DFT}} \Vert^2$ with $w_E$ and $w_F$ tuned so that energy and force residuals per atom contribute comparably in magnitude (empirically, $w_E \sim 10^{-2}\ \text{eV}^{-2}\text{-atom}^{-1}$ , $w_F \sim 10^{-1}\ (\text{eV}/\text{Å})^{-2}$ ). Tikhonov (ridge) regularization on the coefficients $c_\alpha$ prevents overfitting; $\lambda \sim 10^{-6}$ – $10^{-5}$ is typically sufficient.

Key features of the workflow include isotropic expansion to mimic finite temperature (up to 1000 K), random displacements, and small strains to sample vibrational anharmonicity.

With MBS, the final dataset achieves a motif "packing density" (fraction of possible motifs present) up to 30% higher than with random sampling and achieves $D_{\mathrm{JS}}$ reductions of up to 0.12 bits (e.g., $0.4814 \rightarrow 0.3661$ for CrCoNi with 702 structures).

4. Validation: Accuracy and Transferability across Compositions

Quantitative property prediction benchmarks illustrate the consequences of motif entropy sampling:

Energy/Force Accuracy (CrCoNi): MBS-trained MLPs achieve $\mathrm{MAE}_E \approx 1$ meV/atom ( $<1$ meV/atom variation, even at high SRO), compared with $3$–$5$ meV/atom and $\sim5$ meV variation for randomly sampled MLPs; force RMSE is consistently $\lesssim 0.05$ eV/Å.
Phase Diagrams: Predicted fcc–bcc transition boundaries for Cr–Ni and Cr–Co binaries match experiment and CALPHAD within 25–30 K; Au–Pt miscibility gap critical points deviate by only 5% from early experimental data.
Melting Points: For fcc CrCoNi, $T_m^{\mathrm{MLP}} = 1641$ K (3% below experiment). For bcc TaTiVW and derivatives, melting predictions are within 2–5% of experimental windows.
Short-Range Order and Lattice Expansion: Predicted Warren–Cowley $\alpha_{ij}$ and $a(T)$ for various compositions and temperatures are within experimental uncertainty ( $<0.02$ in SRO, $<0.3\%$ in lattice parameter).
Thermodynamics: Specific heats ( $c_p$ ) of TaTiVW alloys are predicted within 2–13% of high-temperature NIST and other measurements.
Fault Energies: Stacking-fault energies $\gamma_{\mathrm{SFE}}$ in CrCoNi at 500 K map correctly and match first-principles trends as Cr is reduced.

Crucially, MBS-MLPs reduce composition-dependent variation of energy and force errors by two orders of magnitude compared to broader “universal” MLPs (MatterSim, Orb, MACE).

5. Computational Requirements and Scaling

The cost of motif-based sampling and dataset construction is negligible relative to reference quantum mechanical calculations. For example, a 702-structure MBS dataset can be built in less than 10% of the CPU time needed for a single-composition DFT–MC trajectory, yet delivers substantially improved transferability. PACE MLPs exhibit linear scaling ( $O(N)$ ) with system size, and inference is efficient enough to support atomistic molecular dynamics of up to millions of atoms.

Dataset sizes for coverage (examples from the work):

Au–Pt: $\sim200$ configurations across stoichiometries and SRO.
Cu–Au: $\sim150$ configurations in analogous sampling.
Cr–Co–Ni: 66 (benchmark) and 702 (production) structures covering 12 compositions, multiple phases, and $\sim20$ stoichiometric compounds.
Ti–Ta–V–W: $\sim250$ configurations targeting equiatomic and derivative compositions.

6. Implications, Extensions, and Future Perspectives

The direct enforcement of motif uniformity achieves order-of-magnitude improvements in compositional transferability, enabling a single MLP to quantitatively predict properties from binary subsystems to high-entropy phases and even liquids without retraining. The motif-based approach is system-agnostic and direct application to other metallic multicomponent alloys, intermetallics, and even non-metallic systems is feasible, contingent on the definition of an appropriate motif alphabet.

Prospective extensions include integration with generative structure proposals to automate the exploration and inclusion of relevant metastable phases. The methodology lays a foundation for the systematic construction of interatomic potentials that are both data-efficient and physically robust across the full compositional and structural landscape—directly addressing a principal limitation of prior universal or database-trained models.

In summary, the combination of information-theoretic sampling (maximally uniform motif entropy) with robust, body-ordered ACE models (PACE) yields a practical and efficient strategy for constructing transferable, high-fidelity MLPs for alloys. These models simultaneously capture chemical diversity, structural complexity, and thermal perturbations, providing a rigorous foundation for predictive atomistic simulation across phase diagrams, property trends, and order–disorder regimes.

PDF Markdown Chat (Pro)

References (1)

Machine learning potentials for modeling alloys across compositions (2025)

Follow Topic

Get notified by email when new papers are published related to Machine Learning Potentials (MLPs).