Papers
Topics
Authors
Recent
Search
2000 character limit reached

Multi-fidelity Surrogates

Updated 2 March 2026
  • Multi-fidelity surrogates are predictive models that fuse low-cost, low-accuracy data with expensive high-fidelity data to enable efficient design exploration.
  • The LS-MFS framework applies linear regression by incorporating a scaling factor and discrepancy function, yielding robust parameter estimates and rapid evaluations.
  • These models are widely used for design optimization, uncertainty quantification, and sensitivity analysis, offering a computationally lightweight alternative to more complex Bayesian approaches.

Multi-fidelity surrogates are predictive models that synthesize data from computational or experimental sources with varying accuracy and cost. Their primary purpose is to enable efficient and accurate emulation of expensive high-fidelity models by leveraging abundant low-cost, lower-fidelity data. These frameworks have found wide adoption in design optimization, uncertainty quantification, sensitivity analysis, and reliability assessment in computational science and engineering. The following provides a rigorous, detailed exposition of the least-squares multi-fidelity surrogate (LS-MFS) framework, as introduced and analyzed by Zhang et al. (Zhang et al., 2017).

1. Linear Regression Formulation

The central structure of LS-MFS is the approximation of a high-fidelity model fH(x)f_H(x) using both a scaled low-fidelity model fL(x)f_L(x) and a discrepancy function expressed in a user-specified basis. The surrogate takes the form: fH(x)ρfL(x)+δ(x)(1)f_H(x) \approx \rho\, f_L(x) + \delta(x) \qquad (1) where ρ\rho is a scale factor bridging the trend between fidelities, and δ(x)\delta(x) is the modeling discrepancy expanded as

δ(x)=i=1pbiXi(x)\delta(x) = \sum_{i=1}^p b_i\,X_i(x)

with X1(x),,Xp(x)X_1(x), \ldots, X_p(x) denoting known basis functions (e.g., polynomials), and bib_i their coefficients. Thus, the total surrogate becomes

fH(x)ρfL(x)+i=1pbiXi(x)(2)f_H(x) \approx \rho\, f_L(x) + \sum_{i=1}^p b_i\,X_i(x) \qquad (2)

This proposal regards fL(x)f_L(x) as a basis element in the space spanned by the regression model, with ρ\rho as its regression coefficient.

2. Design Matrix and Parameter Estimation

Given nn high-fidelity observations {(x(j),yH(j))}j=1n\{(x^{(j)}, y_H^{(j)})\}_{j=1}^n, one constructs the response vector yH=(yH(1),...,yH(n))Ty_H = (y_H^{(1)}, ..., y_H^{(n)})^T. The design matrix DRn×(p+1)D \in \mathbb{R}^{n \times (p+1)} is built as

D=[fL(x(1))X1(x(1))Xp(x(1))  fL(x(n))X1(x(n))Xp(x(n))]D = \begin{bmatrix} f_L(x^{(1)}) & X_1(x^{(1)}) & \cdots & X_p(x^{(1)}) \ \vdots & \vdots & & \vdots \ f_L(x^{(n)}) & X_1(x^{(n)}) & \cdots & X_p(x^{(n)}) \end{bmatrix}

The stacked parameter vector is β=[ρ,b1,,bp]T\beta = [\rho, b_1, \ldots, b_p]^T. The regression equation reads

Dβ=yH+ϵD\,\beta = y_H + \epsilon

where ϵ\epsilon represents residuals.

The parameter solution is given by the normal equations,

β^=(DTD)1DTyH\hat{\beta} = (D^T D)^{-1} D^T y_H

provided DD has full column rank (p+1p+1). This guarantees uniqueness and stability in the parameter estimates. This approach is extendable: for multi-level fidelity schemes, one adds more columns (e.g., fL2(x)f_{L2}(x), fL3(x)f_{L3}(x)) with associated scale factors.

3. Extensions and Algorithmic Variants

Several modifications are supported:

  • The basis {Xi(x)}\{X_i(x)\} can be polynomials, radial basis functions, splines, wavelets, or arbitrary feature transformations.
  • Regularization (ridge, lasso) can be introduced for noisy or collinear settings.
  • For non-polynomial or heavy-tailed errors, robust regression (e.g., Huber loss) or Bayesian linear models are appropriate. Ordinary least squares provides maximum-likelihood estimation under Gaussian noise.
  • The core methodology generalizes to nonlinear regression models, such as kernel machines or neural networks, by including fL(x)f_L(x) as an input feature.
  • The method is readily extensible to more than two fidelities.

4. Comparison with Gaussian-Process-Based Approaches

Bayesian multi-fidelity surrogates (e.g., Kennedy–O’Hagan models with Gaussian-process priors) provide:

  • Predictive means and covariances, enabling uncertainty quantification.
  • Flexible handling of spatial correlations and model discrepancies.

However, Gaussian-process methods require nonlinear hyperparameter optimization (typically by maximum likelihood), which is computationally intensive and susceptible to local minima—especially under noisy data or small sample sizes. In contrast, LS-MFS offers:

  • A single closed-form estimation step.
  • Robustness under small nn or high-data-noise conditions.
  • Low computational overhead and direct extensibility to arbitrary regression models.

The primary limitations of LS-MFS are absence of built-in spatial correlation modeling and the need to reconstruct variances/intervals from classical regression theory rather than inheriting them from the surrogate.

5. Application Areas

LS-MFS is applicable to a range of tasks:

  • Prediction variance is computed by regression theory, e.g.,

Var(y^)=σ2(DTD)1DTD(DTD)1\operatorname{Var}(\hat{y}) = \sigma^2 (D^T D)^{-1} D^T D (D^T D)^{-1}

where σ2\sigma^2 is the sample variance of residuals.

  • Design of experiments: D-optimal design can be pursued by maximizing det(DTD)\det(D^T D).
  • Uncertainty propagation: once trained, the surrogate enables rapid sampling for downstream uncertainty quantification.
  • Design optimization: the surrogate’s algebraic form is differentiable and can be efficiently embedded in gradient-based optimization loops.

6. Numerical Example and Performance

Zhang et al. present a worked example involving the function

fH(x1,x2)=1001exp(2x2)300x3+1900x2+2092x1+60+1001exp(2x2)100x3+500x2+4x1+20f_H(x_1, x_2) = 100\frac{1 - \exp(-2x_2)}{300x_3 + 1900x_2 + 2092x_1 + 60} + 100\frac{1 - \exp(-2x_2)}{100x_3 + 500x_2 + 4x_1 + 20}

with additive Gaussian noise. The low-fidelity model is a perturbation of fHf_H. Using minimal (2×2, 3×3) factorial designs and polynomial bases for the discrepancy, the LS-MFS dramatically outperforms a pure high-fidelity polynomial response surface: at n=4n = 4 LS-MFS achieves relative RMSE 9%\sim 9\% compared to 44%44\% for the high-fidelity surface; at n=9n = 9 the advantage grows to 1.5%1.5\% vs. 26%26\%.

7. Significance, Limitations, and Outlook

LS-MFS provides a mathematically transparent, computationally lightweight fusion of low- and high-fidelity models. Its conceptual simplicity—treating fL(x)f_L(x) as a regression basis and fitting all coefficients in a single linear system—results in unique, interpretable models under standard rank assumptions and facilitates rapid experimentation and design studies.

Key drawbacks are its lack of automatic spatial correlation modeling and the absence of direct Bayesian intervals, limiting its performance for non-polynomially behaving functions or in applications demanding rigorous probabilistic uncertainty quantification. In such cases, advanced Bayesian frameworks may be preferred, despite their greater computational complexity.

A plausible implication is that for rapid design-space exploration or in settings with extremely limited high-fidelity data, LS-MFS may serve as a robust, first-line surrogate, delegating to more sophisticated models only when strict spatial variance modeling or full Bayesian inference is necessary (Zhang et al., 2017).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-fidelity Surrogates.