Multi-fidelity Surrogates
- Multi-fidelity surrogates are predictive models that fuse low-cost, low-accuracy data with expensive high-fidelity data to enable efficient design exploration.
- The LS-MFS framework applies linear regression by incorporating a scaling factor and discrepancy function, yielding robust parameter estimates and rapid evaluations.
- These models are widely used for design optimization, uncertainty quantification, and sensitivity analysis, offering a computationally lightweight alternative to more complex Bayesian approaches.
Multi-fidelity surrogates are predictive models that synthesize data from computational or experimental sources with varying accuracy and cost. Their primary purpose is to enable efficient and accurate emulation of expensive high-fidelity models by leveraging abundant low-cost, lower-fidelity data. These frameworks have found wide adoption in design optimization, uncertainty quantification, sensitivity analysis, and reliability assessment in computational science and engineering. The following provides a rigorous, detailed exposition of the least-squares multi-fidelity surrogate (LS-MFS) framework, as introduced and analyzed by Zhang et al. (Zhang et al., 2017).
1. Linear Regression Formulation
The central structure of LS-MFS is the approximation of a high-fidelity model using both a scaled low-fidelity model and a discrepancy function expressed in a user-specified basis. The surrogate takes the form: where is a scale factor bridging the trend between fidelities, and is the modeling discrepancy expanded as
with denoting known basis functions (e.g., polynomials), and their coefficients. Thus, the total surrogate becomes
This proposal regards as a basis element in the space spanned by the regression model, with as its regression coefficient.
2. Design Matrix and Parameter Estimation
Given high-fidelity observations , one constructs the response vector . The design matrix is built as
The stacked parameter vector is . The regression equation reads
where represents residuals.
The parameter solution is given by the normal equations,
provided has full column rank (). This guarantees uniqueness and stability in the parameter estimates. This approach is extendable: for multi-level fidelity schemes, one adds more columns (e.g., , ) with associated scale factors.
3. Extensions and Algorithmic Variants
Several modifications are supported:
- The basis can be polynomials, radial basis functions, splines, wavelets, or arbitrary feature transformations.
- Regularization (ridge, lasso) can be introduced for noisy or collinear settings.
- For non-polynomial or heavy-tailed errors, robust regression (e.g., Huber loss) or Bayesian linear models are appropriate. Ordinary least squares provides maximum-likelihood estimation under Gaussian noise.
- The core methodology generalizes to nonlinear regression models, such as kernel machines or neural networks, by including as an input feature.
- The method is readily extensible to more than two fidelities.
4. Comparison with Gaussian-Process-Based Approaches
Bayesian multi-fidelity surrogates (e.g., Kennedy–O’Hagan models with Gaussian-process priors) provide:
- Predictive means and covariances, enabling uncertainty quantification.
- Flexible handling of spatial correlations and model discrepancies.
However, Gaussian-process methods require nonlinear hyperparameter optimization (typically by maximum likelihood), which is computationally intensive and susceptible to local minima—especially under noisy data or small sample sizes. In contrast, LS-MFS offers:
- A single closed-form estimation step.
- Robustness under small or high-data-noise conditions.
- Low computational overhead and direct extensibility to arbitrary regression models.
The primary limitations of LS-MFS are absence of built-in spatial correlation modeling and the need to reconstruct variances/intervals from classical regression theory rather than inheriting them from the surrogate.
5. Application Areas
LS-MFS is applicable to a range of tasks:
- Prediction variance is computed by regression theory, e.g.,
where is the sample variance of residuals.
- Design of experiments: D-optimal design can be pursued by maximizing .
- Uncertainty propagation: once trained, the surrogate enables rapid sampling for downstream uncertainty quantification.
- Design optimization: the surrogate’s algebraic form is differentiable and can be efficiently embedded in gradient-based optimization loops.
6. Numerical Example and Performance
Zhang et al. present a worked example involving the function
with additive Gaussian noise. The low-fidelity model is a perturbation of . Using minimal (2×2, 3×3) factorial designs and polynomial bases for the discrepancy, the LS-MFS dramatically outperforms a pure high-fidelity polynomial response surface: at LS-MFS achieves relative RMSE compared to for the high-fidelity surface; at the advantage grows to vs. .
7. Significance, Limitations, and Outlook
LS-MFS provides a mathematically transparent, computationally lightweight fusion of low- and high-fidelity models. Its conceptual simplicity—treating as a regression basis and fitting all coefficients in a single linear system—results in unique, interpretable models under standard rank assumptions and facilitates rapid experimentation and design studies.
Key drawbacks are its lack of automatic spatial correlation modeling and the absence of direct Bayesian intervals, limiting its performance for non-polynomially behaving functions or in applications demanding rigorous probabilistic uncertainty quantification. In such cases, advanced Bayesian frameworks may be preferred, despite their greater computational complexity.
A plausible implication is that for rapid design-space exploration or in settings with extremely limited high-fidelity data, LS-MFS may serve as a robust, first-line surrogate, delegating to more sophisticated models only when strict spatial variance modeling or full Bayesian inference is necessary (Zhang et al., 2017).