Physics-Informed GPC for Alloy Design
- Physics-Informed GPC is a Bayesian framework that embeds domain-specific physics knowledge into Gaussian process classification, enhancing constraint satisfaction in alloy discovery.
- The method leverages tailored priors and hyperparameter optimization via marginal likelihood to reduce posterior uncertainty and guide active learning in high-dimensional design spaces.
- Empirical case studies demonstrate improved accuracy metrics and reduced experimental costs in alloy design, validating the integration of physics-based models in GPC.
Physics-Informed Gaussian Process Classification (GPC) refers to a Bayesian framework in which the Gaussian Process (GP) prior mean is endowed with explicit, domain-specific physics-based information, enabling the model to capture and enforce constraints directly relevant to materials and alloy design. The approach addresses constraint-satisfaction in alloy discovery by uniting physical models, probabilistic classification, and active learning to efficiently navigate feasible and optimal regions of high-dimensional design spaces (Hardcastle et al., 17 Feb 2025).
1. Mathematical Foundation
The core of physics-informed GPC is the latent function prior:
where is a physics-informed prior mean, and is typically an RBF plus white noise kernel. For a collection of points , the vector of latent function values has a multivariate normal prior:
Classification is recast as a regression problem using pseudo-targets and Gaussian likelihood:
Posterior inference proceeds as in standard GP regression. Letting and denote the posterior mean and variance at test point 0,
1
2
The predicted classification probability is recovered using the sigmoid transformation:
3
2. Physics-Informed Prior Encoding
The specification of the prior mean 4 is the central mechanism for integrating physics-based constraints:
Case Study 1 (CALPHAD-based Phase Stability):
For a given alloy composition 5, equilibrium phase fractions 6, etc., are computed via Thermo-Calc. These are mapped to prior class probabilities 7, e.g., 8 if 9. For each one-vs-rest binary classifier, the prior mean is 0.
Case Study 2 (Valence Electron Concentration):
The Valence Electron Concentration is evaluated as 1, and discrete prior probabilities (Table 4 in (Hardcastle et al., 17 Feb 2025)) are assigned, e.g., 2, 3, 4, leading to 5.
Case Study 3 (Yield-Strength Constraint):
The Maresca–Curtin model, evaluated at 6C, provides 7, directly used as the prior mean in regression for predicting high-temperature yield strength 8, i.e., 9.
3. Hyperparameter Optimization
Kernel hyperparameters 0 and free parameters in 1 are optimized by maximizing the marginal likelihood:
2
Gradients 3 are computed via standard GPR identities, and optimization is performed using L-BFGS-B over 10–50 random restarts (Hardcastle et al., 17 Feb 2025). This guarantees that model flexibility and prior structure jointly fit observed pseudo-classification data.
4. Active Learning and Constraint Incorporation
Information-efficient exploration is achieved by Shannon entropy-based acquisition:
4
The next candidate for acquisition is
5
For categorical constraints (Case 2):
A one-vs-rest GPC ensemble is constructed, and entropy of the predicted probability vector 6 drives experimental campaign selection, focusing on high-uncertainty (e.g., phase boundary) regions.
For continuous threshold constraints (Case 3):
Given predictive normal distribution 7, the probability of exceeding the threshold 8 is
9
This binary-classification proxy is used in the entropy criterion above.
5. Empirical Results Across Case Studies
Three distinct application studies anchor the framework:
| Case | Constraint Type | Prior Model | Performance Impact |
|---|---|---|---|
| 1 | Phase stability, static | CALPHAD | Median accuracy 0 vs. 1–2 for controls; AUC up 3, tighter recall, F1, Brier loss [Fig 4] |
| 2 | Categorical, active | VEC rule | Physics-informed AL converges in 4 iterations vs 5 for vanilla; 6–7 higher early accuracy; lower 8(accuracy) [Fig 5] |
| 3 | Continuous threshold, active | Maresca–Curtin model | Recall 9 vs 0 for vanilla; Brier loss down by 1; log-loss down by 2 in first 3 iterations [Fig 6,7] |
In all scenarios, the introduction of physics-based 4 enhanced both predictive accuracy and sample efficiency, especially in data-scarce regimes or with expensive experimental endpoints (e.g., XRD, mechanical tests) (Hardcastle et al., 17 Feb 2025).
6. Functional Advantages and Limitations
Physics-informed priors sharply reduce posterior uncertainty where domain knowledge is robust, improving model calibration and recall. This leads to fewer high-cost experiments needed to map feasible regions or constraints. Across recall, accuracy, F1, Brier loss, and log-loss, models with 5 outperform both purely statistical (vanilla GPC, uniform prior) and heuristic (pure CALPHAD) baselines in all tested alloy-design scenarios. Notably, the surrogate regression formulation simplifies implementation but introduces a Gaussian likelihood approximation, differing from full Laplace/EP GPC inference methods.
A reliance on sufficiently accurate physical models (e.g., CALPHAD, VEC) is a limitation: biased priors can mislead the classifier. The post hoc normalization in the one-vs-rest ensemble may impair multiclass probabilistic calibration. Future extensions include replacing the surrogate with true non-Gaussian inference and extending the framework to multi-constraint, multi-objective Bayesian optimization by chaining constraint classifiers with physics-informed GP regressors for objectives.
7. Outlook and Broader Implications
Physics-Informed GPC formalizes a unification of mechanistic modeling, Bayesian classification, and sample-efficient exploration for constraint-driven scientific discovery. By explicitly embedding physics-based approximations into the GP mean, the approach achieves robust extrapolation, improved active learning navigation, and substantial cost reductions for experimental alloy design. A plausible implication is enhanced design efficiency in any setting where feasible regions are expensive to probe and credible physical models exist. Continued research may enable hierarchical or compositional priors, full multiclass calibration, and broad application to multi-objective optimization in complex scientific domains (Hardcastle et al., 17 Feb 2025).