Machine Learned Interatomic Potentials

Updated 20 December 2025

Machine Learned Interatomic Potentials (MLIPs) are data-driven surrogate models that predict energies, forces, and material properties with near quantum accuracy while reducing computational cost.
They decompose total energy into contributions from local atomic environments using methods like kernel regression, neural networks, and moment tensor expansions with built-in symmetry constraints.
Recent advances include multi-fidelity training, uncertainty quantification, and model compression techniques that enhance MLIP reliability, scalability, and applicability to complex systems.

Machine Learned Interatomic Potentials (MLIP)

Machine learned interatomic potentials (MLIPs) are data-driven surrogate models for the quantum-mechanical potential energy surface (PES) of materials, molecules, and condensed phases. MLIPs aim to achieve near first-principles accuracy in predicting energies, forces, and derived properties while offering several orders of magnitude speedup over direct electronic-structure calculations. Their mathematical form typically expresses the total energy as a sum or functional of local atomic environments, with parametrization learned from datasets of reference calculations—most commonly density functional theory (DFT), but increasingly also higher-level ab initio data or multi-fidelity combinations. Modern MLIPs leverage physically informed symmetry constraints, advanced regression techniques, uncertainty quantification, and hybrid loss functions to extend their accuracy and reliability across broad classes of chemical and configurational complexity.

1. Mathematical Formulation and Model Classes

All MLIPs share a common structure wherein the potential energy is decomposed in terms of local atomic environments: $E_\text{tot} = \sum_{i=1}^N E_i({\mathcal G}_i)$ where ${\mathcal G}_i$ denotes the graph or local environment centered on atom $i$ . Models differ in the representation of ${\mathcal G}_i$ , the functional form for $E_i$ , and how symmetries are built in.

Representative Model Architectures:

Gaussian Approximation Potential (GAP): Kernel-ridge regression using the Smooth Overlap of Atomic Positions (SOAP) kernel, producing linear combinations over a sparse set of reference environments (Leimeroth et al., 5 May 2025).
High-Dimensional Neural Network Potential (HDNNP): Behler–Parrinello feed-forward neural networks per element, acting on atom-centered symmetry functions (Leimeroth et al., 5 May 2025).
Moment Tensor Potentials (MTP): Linear expansion in systematically constructed moment tensor descriptors; basis functions are contracts of radial basis functions and spherical harmonics over neighbor environments (Leimeroth et al., 5 May 2025, Choyal et al., 2023).
Spectral Neighbor Analysis Potential (SNAP/qSNAP): Linear or quadratic regression on bispectrum components derived from hyperspherical harmonics; enables robust fitting for complex alloys (Choyal et al., 2023, Baghishov et al., 6 Jun 2025).
Atomic Cluster Expansion (ACE): Hierarchical multi-body basis, linear or nonlinear, with full or nonlinear regression (Leimeroth et al., 5 May 2025).
Message-Passing Neural Networks (MPNNs): Equivariant graph neural networks (e.g., NequIP, Allegro, MACE), which propagate information via learned tensor contractions over atomic graphs, enforcing E(3) symmetry (Leimeroth et al., 5 May 2025, Brunken et al., 28 May 2025).
Charge-Equilibrated MLIPs (NequIP-LR): Incorporate global charge redistribution using QEq-like schemes embedded in equivariant network backbones, enabling correct long-range Coulomb interactions (Maruf et al., 23 Mar 2025).

Force and stress prediction is obtained by automatic differentiation: $\vec F_i = -\frac{\partial E_\text{tot}}{\partial \vec r_i}, \qquad \Xi_{\alpha\beta} = -\frac{\partial E_\text{tot}}{\partial \epsilon_{\gamma\alpha}}\,\epsilon_{\gamma\beta}$

2. Dataset Generation, Loss Functions, and Training Protocols

Dataset Generation

Effective MLIP training requires that the dataset samples the relevant configurational, chemical, and thermodynamic space. Protocols include:

Entropy-maximized sampling and leverage-score subsampling for highly diverse atomic environments, reducing the number of expensive reference calculations required for target accuracy (Baghishov et al., 6 Jun 2025).
Genetic algorithm–driven structural exploration to capture unusual bonding topologies and non-equimolar compositions in complex materials like Si–C (MacIsaac et al., 2024).
Multi-fidelity hierarchical data: Simultaneous training on low-cost, lower-fidelity data (e.g., GGA) and high-level, expensive data (meta-GGA, RPA, CCSD(T)), with one-hot encoded fidelity indices and learnable per-fidelity corrections (Kim et al., 2024).

Loss Function Formulations

Loss objectives are typically weighted sums over energies, forces, and sometimes virials or higher derivatives: $\mathcal{L} = w_E \sum (E^\text{MLIP} - E^\text{DFT})^2 + w_F \sum (\vec F^\text{MLIP} - \vec F^\text{DFT})^2 + w_V \sum (\Xi^\text{MLIP} - \Xi^\text{DFT})^2 + \lambda \|\theta\|^2$ Force weighting is critical for robust MD and property prediction (Baghishov et al., 6 Jun 2025, Choyal et al., 2023). Advanced loss terms include:

Physics-informed weakly supervised losses: Taylor expansion–based and conservative-force path-independence penalties to enforce energy–force consistency and suppress unphysical artifacts, especially in data-scarce regimes (Takamoto et al., 2024).
Knowledge distillation: Ensemble teacher models trained with energy-only supervise students using synthetic ensemble-averaged force labels, thus enabling state-of-the-art MD stability from pure energy datasets (Matin et al., 18 Mar 2025).
Uncertainty regularization: Bayesian variational dropout (BLIP framework) yields per-structure predictive uncertainty essential for active learning and out-of-distribution detection (Coscia et al., 19 Aug 2025).

Training employs optimizers such as Adam or L-BFGS, dynamic learning rate schemata, and sometimes early stopping based on validation performance (Brunken et al., 28 May 2025, Choyal et al., 2023).

3. Recent Methodological Advances

3.1 Uncertainty Quantification and Active Learning

Ensuring MLIP reliability under extrapolation and in unexplored configurational regions is addressed by:

Ensemble- and Bayesian-based uncertainties: Ensemble spread (epistemic) and data-likelihood (aleatoric) components are quantified, guiding acquisition in active learning cycles (Kang et al., 2024, Coscia et al., 19 Aug 2025).
Active learning loops for strongly anharmonic systems: Uncertainty-driven selection (using per-atom force uncertainty maxima and anharmonicity markers) accelerates discovery of rare-events, preventing spurious minima or missed metastable states, and guarantees physically valid MD (Kang et al., 2024).

3.2 Multi-Fidelity and Δ-Learning

To overcome the scarcity and expense of high-accuracy labels:

Multi-fidelity GNN schemes: Simultaneous training on GGA and meta-GGA/CCSD(T) via shared and fidelity-specific network weights, achieving near-gold-standard accuracy with <20% high-fidelity data supplement (Kim et al., 2024).
Δ-Learning: Fitting an MLIP to the difference between a baseline (e.g., DFT-D or tight-binding) and a high-level method (e.g., CCSD(T)), allowing rapid application of chemical accuracy to large or periodic systems, including vdW-dominated structures (Ikeda et al., 19 Aug 2025).

3.3 Model Compression and Efficiency

Scaling MLIPs to large, multi-component systems and long MD trajectories necessitates efficiency:

Low-rank matrix/tensor decompositions in MTP, reducing parameter count by up to 50% with negligible accuracy loss; enables per-atom evaluation cost reduction proportional to parameter count (Vorotnikov et al., 4 Sep 2025).
Strictly local E(3)-equivariant architectures (e.g., Allegro) further accelerate evaluation while maintaining high accuracy for defected and large-scale 2D systems (Janisch et al., 12 Dec 2025).

4. Benchmarking, Validation, and Performance

Extensive benchmarking reveals strengths and trade-offs among MLIP frameworks:

Energy/Force RMSE: State-of-the-art models (e.g., MACE, NequIP, Allegro, nonlinear ACE) achieve 1–2 meV/atom and 15–50 meV/Å on complex multi-component datasets (Leimeroth et al., 5 May 2025). Bayesian fine-tuning and knowledge distillation can reduce errors by up to 25% relative to conventional training (Matin et al., 18 Mar 2025, Coscia et al., 19 Aug 2025).
Transferability: Minimalist training sets can already yield quasi-quantitative accuracy in unseen complex structural and topological phenomena (e.g., phase transitions, vortex states, grain boundary motion) (Robredo-Magro et al., 21 Nov 2025, Janisch et al., 12 Dec 2025).
Stability in MD: MD stability (time step before crash, energy drift) is strongly correlated with force accuracy, smoothness of the PES, and loss enforcement of conservative force fields (Matin et al., 18 Mar 2025, Leimeroth et al., 5 May 2025).
Computational performance: Linear or low-rank models (ACE, MTP) enable high-throughput MD (≈10⁵ atoms·steps/s), while equivariant message-passing models excel on GPU architectures and maintain high accuracy on ionic/covalent and metallic systems (Leimeroth et al., 5 May 2025).

Model Class	Accuracy (meV/atom)	Force RMSE (meV/Å)	CPU Cost (ms/atom·step)
MACE, NequIP	0.8–1.5	17–40	1–5
Allegro	1.6–2.0	35–45	2–3
nonlinear ACE	1.5–2.0	30–50	0.1–0.2
MTP	5.0	80	0.1

Performance metrics as in (Leimeroth et al., 5 May 2025). Viable choices depend on system size, accessible computational resources, and required accuracy.

5. Physical Fidelity, Generalization, and Best Practices

Physics-Informed Regularization and Generalization Theory

Energy–force consistency: Incorporating physics-informed auxiliary losses (path-independence, Taylor expansion) regularizes MLIPs in data-sparse regimes, ensuring smooth energy landscapes and robust MD, even without explicit force labels (Takamoto et al., 2024).
Training cell size and observable selection: Generalization error decays as the size of training supercells increases, and as higher-order quantities (forces, force constants) are included in the fitting loss (Ortner et al., 2022).
Composite loss normalization: Proper weighting (e.g., energy error ~ (force error)²⁾ improves generalization to new configurations.

Practical Guidelines

Leverage-score and entropy-maximized sampling systematically reduce the number of required expensive ab initio calculations (Baghishov et al., 6 Jun 2025).
Tuning the energy:force loss ratio optimally adapts to noise in training labels—larger force weights are preferred when energies are imprecise or less converged.
For multi-component, highly disordered or high-entropy systems, robust linear models (MTP, SNAP) outperform neural or kernel models in low-data regimes, but nonlinear models (AENET, GAP) ultimately achieve higher accuracy and transferability given sufficient data (Choyal et al., 2023).
Incorporation of explicit long-range physics (e.g., D3 or QEq) extends MLIPS to vdW-bonded and charge-heterogeneous systems with minimal modifications (Sauer et al., 8 Apr 2025, Maruf et al., 23 Mar 2025).

6. Current Limitations and Emerging Directions

Chemical diversity: while current MLIPs are impressive within interpolation domains, extension across chemical space (elements, charge states, transfer to interfaces) necessitates richer descriptor bases, adaptive architectures, and physics-motivated regularization (Vorotnikov et al., 4 Sep 2025, Janisch et al., 12 Dec 2025).
Long-range interactions: Incorporation of global charge redistribution (e.g., in NequIP-LR), physically motivated dispersion corrections, and explicit multipole models further expands MLIP access to complex electrostatics (Maruf et al., 23 Mar 2025, Sauer et al., 8 Apr 2025).
Uncertainty estimation for active learning: Bayesian frameworks (BLIP) and ensemble knowledge distillation enable automated, uncertainty-driven dataset construction and robust model refinement (Coscia et al., 19 Aug 2025, Kang et al., 2024).
Data efficiency at high-fidelity: Multi-fidelity and Δ-learning protocols permit the extraction of chemical accuracy with a fraction of the CCSD(T)-level or meta-GGA data otherwise required (Kim et al., 2024, Ikeda et al., 19 Aug 2025).
Open-source adoption and workflow integration: Modular libraries (e.g., mlip (Brunken et al., 28 May 2025)) consolidate model training, evaluation, and integration with major MD engines (ASE, JAX-MD, LAMMPS), promoting rapid deployment and reproducibility.

MLIPs thus provide a general, extensible, and physically sound framework for high-throughput atomistic simulation, accelerated materials design, and fundamental studies of structural and dynamical phenomena across chemistry and materials science. Their ongoing development continues to close the gap between quantum-chemical accuracy and tractable simulation of large-scale, complex systems.

Markdown Upgrade to Chat

References (17)

Machine-learning interatomic potentials from a users perspective: A comparison of accuracy, speed and data efficiency (2025)

Constructing and evaluating machine-learned interatomic potentials for Li-based disordered rocksalts (2023)

Application-specific Machine-Learned Interatomic Potentials: Exploring the Trade-off Between Precision and Computational Cost (2025)

Machine Learning Interatomic Potentials: library for efficient training, model development and simulation of molecular systems (2025)

Equivariant Machine Learning Interatomic Potentials with Global Charge Redistribution (2025)

A Genetic Algorithm Trained Machine-Learned Interatomic Potential for the Silicon-Carbon System (2024)

Data-efficient multi-fidelity training for high-fidelity machine learning interatomic potentials (2024)

Physics-Informed Weakly Supervised Learning for Interatomic Potentials (2024)

Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials (2025)

10.

BLIPs: Bayesian Learned Interatomic Potentials (2025)

11.

Accelerating the Training and Improving the Reliability of Machine-Learned Interatomic Potentials for Strongly Anharmonic Materials through Active Learning (2024)

12.

Machine-learning interatomic potentials achieving CCSD(T) accuracy for van-der-Waals-dominated systems via Δ-learning (2025)

13.

Low-rank matrix and tensor approximations: advancing efficiency of machine-learning interatomic potentials (2025)

14.

Machine learned potential for defected single layer hexagonal boron nitride (2025)

15.

Minimalist machine-learned interatomic potentials can predict complex structural behaviors accurately (2025)

16.

A framework for a generalisation analysis of machine-learned interatomic potentials (2022)

17.

Dispersion-corrected Machine Learning Potentials for 2D van der Waals Materials (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Machine Learned Interatomic Potentials (MLIP).