Multi-Fidelity Residual Neural Processes

Updated 8 February 2026

Multi-Fidelity Residual Neural Processes (MFRNP) are a deep learning framework for surrogate modeling that explicitly aggregates lower-fidelity predictions and applies a residual correction.
They use a two-stage decode–then–residual approach to achieve scalable, high-dimensional emulation and robust out-of-distribution generalization.
MFRNP outperforms traditional GP-based and neural surrogates in tasks such as PDEs, climate emulation, and robotics, offering real-time calibration and uncertainty quantification.

Multi-Fidelity Residual Neural Processes (MFRNP) are a deep learning framework designed for surrogate modeling in settings characterized by the availability of outputs from simulators or experiments of multiple fidelities. The core innovation of MFRNP is the explicit modeling and aggregation of decoded predictions from lower-fidelity neural surrogates, followed by residual correction at the highest fidelity. This two-stage “decode–then–residual” approach enables scalable, accurate surrogate models capable of high-dimensional emulation and robust generalization to out-of-distribution (OOD) scenarios, significantly outperforming existing Gaussian Process (GP)-based and neural surrogates across PDEs, real-world climate emulation, and robotics (Niu et al., 2024, Hunter et al., 11 Nov 2025).

1. Model Architecture and Aggregation Strategy

MFRNP decomposes the multi-fidelity surrogate modeling task into two principal components. For $K$ fidelities, with $k=1,2,\ldots,K-1$ denoting lower fidelities and $K$ the highest fidelity:

Lower-Fidelity NPs: A separate Neural Process (NP) surrogate is learned for each lower fidelity. Each NP uses an encoder $q_{\phi_k}(z_k | D_k^c)$ operating on a split of context/target sets $(D_k^c, D_k^t)$ , producing a latent $z_k \in \mathbb{R}^{d_z}$ , and a decoder $p_{\theta_k}(y | z_k, x)$ mapping $(z_k, x)$ to output distributions.
Decoded Aggregation: For a given input $x$ , outputs from all $K-1$ lower-fidelity decoders are averaged:

$k=1,2,\ldots,K-1$ 0

where $k=1,2,\ldots,K-1$ 1.

Residual NP: Instead of directly modeling the highest fidelity, a residual Neural Process is trained with context $k=1,2,\ldots,K-1$ 2 over the highest-fidelity inputs $k=1,2,\ldots,K-1$ 3, where $k=1,2,\ldots,K-1$ 4. Its encoder $k=1,2,\ldots,K-1$ 5 explicitly receives all lower-fidelity latents and decoder parameters, and its decoder $k=1,2,\ldots,K-1$ 6 predicts $k=1,2,\ldots,K-1$ 7.

The final high-fidelity prediction is given by aggregation plus residual:

$k=1,2,\ldots,K-1$ 8

where $k=1,2,\ldots,K-1$ 9 (Niu et al., 2024).

2. Residual–Evidence Lower Bound and Joint Training

The training objective is a tailored Evidence Lower Bound (Residual-ELBO), reflecting both direct surrogate modeling at lower fidelities and residual learning at the highest fidelity:

Fidelity-Specific ELBO, $K$ 0:

$K$ 1

Residual ELBO, $K$ 2:

$K$ 3

Total Loss: Estimated with Monte Carlo samples:

$K$ 4

This objective necessitates that the highest-fidelity latent $K$ 5 and its prediction depend on the decoded (not just latent) lower-fidelity information, enforcing both in-fidelity accuracy and cross-fidelity informational coupling (Niu et al., 2024).

3. Inference Pipeline and Extrapolation Capacity

At inference, for an unseen input at the highest fidelity, MFRNP executes the following sequence:

Lower-Fidelity Staging: For each $K$ 6, latent samples are drawn from $K$ 7, followed by decoding and linear interpolation as needed, yielding predictions $K$ 8 on the finest input grid.
Aggregation: $K$ 9.
Residual Correction: Draw samples $q_{\phi_k}(z_k | D_k^c)$ 0, decode $q_{\phi_k}(z_k | D_k^c)$ 1.
Output: $q_{\phi_k}(z_k | D_k^c)$ 2.

The residual NP’s encoder, by directly conditioning on decoded lower-fidelity outputs, enables MFRNP to extrapolate beyond the domain of available high-fidelity training data, conferring a substantive OOD generalization advantage (Niu et al., 2024).

Unlike prior multi-fidelity neural surrogates, which share only latent representations (commonly realized as a shared $q_{\phi_k}(z_k | D_k^c)$ 3 variable decoded by fidelity-specific decoders $q_{\phi_k}(z_k | D_k^c)$ 4), MFRNP's explicit sharing of decoder outputs resolves key inconsistencies:

Latent Sharing Limitation: Shared latents decoded by different $q_{\phi_k}(z_k | D_k^c)$ 5 may yield mutually inconsistent predictions for $q_{\phi_k}(z_k | D_k^c)$ 6.
Decoder-Aggregation: Incorporating $q_{\phi_k}(z_k | D_k^c)$ 7 into residual learning ensures that all lower-fidelity decoders influence highest-fidelity predictions.
No Feed-Forward Error Propagation: By aggregating concrete, decoded predictions, MFRNP avoids the chained error propagation of hierarchical latent-only schemes and mitigates calibration errors and instability.

Ablation analyses confirm that replacing MFRNP’s decode–aggregate module with a purely latent hierarchical aggregator significantly increases prediction error (order-of-magnitude deterioration in normalized RMSE), affirming the necessity of decoder-level information sharing (Niu et al., 2024).

5. Extensions: Physics-Informed and Uncertainty-Calibrated Variants

The Multi-Fidelity Residual Physics-Informed Neural Process (MFR-PINP) generalizes MFRNP by integrating domain-specific physics priors and uncertainty calibration:

Architecture: Two parallel branches, the first a low-fidelity NP learning a surrogate to an analytic model $q_{\phi_k}(z_k | D_k^c)$ 8, the second a residual NP correcting from this surrogate towards high-fidelity ground truth or an enhanced physics prior $q_{\phi_k}(z_k | D_k^c)$ 9.
Physics-Informed Feature Integration: Physics priors $(D_k^c, D_k^t)$ 0, $(D_k^c, D_k^t)$ 1 are introduced as input features; a frozen copy of the low-fidelity decoder stabilizes the residual learning process.
Uncertainty Quantification: Split conformal prediction is employed during inference, yielding prediction intervals with finite-sample coverage guarantees for each state dimension.
Output Fusion: High-fidelity prediction is a Gaussian with mean $(D_k^c, D_k^t)$ 2 and variance $(D_k^c, D_k^t)$ 3 (Hunter et al., 11 Nov 2025).

A plausible implication is that this class of multi-fidelity decoupling and residual assembly architectures allows direct incorporation of physical knowledge, robust calibration, and interpretable uncertainty bounds in real-time applications such as robotics and control.

6. Empirical Performance and Benchmarks

MFRNP demonstrates superior empirical performance across multiple domains:

Task & Domain	Metric	Best MFRNP Performance	Baseline Range	Relative Improvement
PDE Surrogates (Heat/Poisson/Fluid)	nRMSE (full domain)	0.004–0.007	0.10–0.38	∼90% error reduction
PDE Surrogates (OOD)	nRMSE	0.005–0.018	0.14–0.75	Outperforms all baselines
Climate (CMIP6/ERA5)	Weighted nRMSE	Lower in 3/4 scenarios	All compared baselines	More stable over time
Robotics (State Estimation) [PINP]	Per-step RMSE/NLL	0.154, NLL –1.274	DKF RMSE 0.958, NLL –0.192	Best in both accuracy and fidelity

Baselines in these studies include single-fidelity NPs, MF GPs (NARGP), hierarchical and disentangled MF NPs (MFHNP, D-MFD), Deep Multi-fidelity Active Learning (DMF), neural network GPs, and domain-specific frameworks such as DeepESD (Niu et al., 2024, Hunter et al., 11 Nov 2025).

MFRNP's advantage is most pronounced in complex, high-dimensional PDEs and under OOD configurations where high-fidelity data is sparse or localized. In climate emulation, MFRNP maintains consistent prediction accuracy over multi-decadal time ranges and at fine spatial resolutions, with baseline methods exhibiting degraded performance as projections extend (Niu et al., 2024).

In real-time robotic state estimation, MFR-PINP achieves lower test RMSE and better negative log-likelihood than transformer-based Deep Kalman Filters, with all neural approaches meeting strict real-time requirements on hardware-constrained platforms (Hunter et al., 11 Nov 2025).

7. Broader Implications, Limitations, and Future Directions

MFRNP offers a general-purpose, scalable surrogate modeling paradigm with strong performance in high-dimensional, multi-fidelity, and OOD regimes. Its architecture systematically leverages all available fidelity levels via decoded aggregation and residual modeling. MFRNP has already demonstrated advantages in physics-based emulation, sensor fusion, and real-time tracking and is extensible to broader domains including aerodynamics, structural health monitoring, and environmental forecasting, where rapid adaptation, physical consistency, and calibrated uncertainty are essential (Niu et al., 2024, Hunter et al., 11 Nov 2025).

Current limitations center on data efficiency—especially the acquisition cost for high-fidelity ground truth—as well as the need for occasional human oversight in model recalibration for safety-critical scenarios. Future directions include hybrid training with synthetic data to reduce sim-to-real gaps, adaptive conformal updates for tighter uncertainty quantification, and application to multi-agent and control-theoretic domains.

MFRNP's combination of decoded aggregation and residual correction establishes it as a foundational tool for modern multi-fidelity surrogate modeling, with unique capabilities to scale, generalize, and calibrate across diverse, high-stakes scientific and engineering tasks.

Markdown Report Issue Upgrade to Chat

References (2)

Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling (2024)

Real-Time Performance Analysis of Multi-Fidelity Residual Physics-Informed Neural Process-Based State Estimation for Robotic Systems (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-Fidelity Residual Neural Processes (MFRNP).