Guided Multi-Fidelity Bayesian Optimization
- Guided multi-fidelity Bayesian optimization is an advanced probabilistic framework that fuses high-cost real experiments with low-cost simulations using a multi-fidelity Gaussian process surrogate.
- It employs a dynamic correction model and an adaptive cost-aware acquisition function to balance experimental cost against simulation accuracy in closed-loop controller tuning.
- Empirical results demonstrate significant reductions in real experiments and robust adaptation to evolving simulation biases in robotics and similar applications.
Guided multi-fidelity Bayesian optimization (GMFBO) is an advanced probabilistic optimization framework that strategically integrates information from multiple, hierarchically related information sources—such as real-world experiments (high fidelity) and simulations or digital twins (low fidelity)—into the controller tuning workflow for closed-loop dynamical systems. This methodology aims to maximize data efficiency in scenarios where real-world evaluations are expensive or limited, by adaptively leveraging both corrected and raw simulation outputs under a cost-sensitive acquisition policy and dynamically adjusting to evolving model inaccuracies.
1. Framework Overview
The GMFBO framework is designed for controller tuning in closed-loop systems where two or more sources of information exist: a high-fidelity source (IS1, typically the real system) and lower-fidelity sources (IS2, such as digital twins or reduced-order models), with the possible inclusion of corrected simulation data (IS3). Central to the method is a multi-fidelity Gaussian process (GP) surrogate that fuses all available data via a custom kernel encoding both parameter similarities and fidelity-dependent relationships.
The iterative optimization proceeds by evaluating candidate controller settings using either IS1, IS2, or IS3, then updating the surrogate GP and the correction model at each iteration. Decisions about which information source to query are governed by a composite acquisition function, ensuring a balanced allocation of experimental and simulation effort according to realtime estimates of cross-source correlations and cost.
2. Digital Twin Correction Model
To address model mismatch between digital twins and real plant dynamics, GMFBO introduces a dedicated correction model based on a GP (GP_c). For a controller setting , simulation IS2 provides predictions over a time horizon , while IS1 delivers true system responses . Paired data is used to train GP_c. The correction model outputs an adjusted estimate:
The model further computes a fidelity estimate via the average predictive uncertainty:
A threshold (relative to a normalized reference signal) determines automatic acceptance of corrected simulation data (IS3) into the GP surrogate's dataset. When , corrected predictions are trusted and added; otherwise, they are discarded to prevent pollution of the optimization trajectory with unreliable surrogate information.
3. Adaptive Cost-Aware Acquisition Function
Candidate input selection in GMFBO is performed by maximizing a cost-aware Expected Improvement (caEI) acquisition function: where combines controller parameters and fidelity indicator. The numerator is the standard GP EI acquisition: with and the best observed objective.
The denominator codifies fidelity-aware cost: where measures real-vs-simulation mismatch, is a scaling constant, and denotes clamping to a cost range. This normalization biases sampling towards low-fidelity (simulation-derived) actions when their expected improvement is high and model mismatch is small; as simulation accuracy worsens, more weight is placed on costly real experiments.
4. Dynamic Online Adaptation
A defining feature of GMFBO is its closed-loop adaptation to time-varying system behavior and digital twin fidelity. Every new real system observation prompts:
- Retraining of the correction GP on updated real-vs-digital twin data.
- Adjustment of the cross-source correlation kernel hyperparameters.
The GP surrogate uses a composite kernel
where is a correlation decay factor parameterized by the current digital twin error, and , model parameter-space dependence. The key kernel lengthscale is dynamically set via
increasing real system prioritization as digital twin reliability deteriorates.
Additionally, when a real-system trial is performed, auxiliary corrected simulation data (IS3) are generated in a neighborhood of the tested controller—expanding the evidence available to the GP with low overhead.
5. Empirical Performance and Results
GMFBO was experimentally validated in both simulation studies and hardware implementation on a Maxon HEJ 90 robotic drive:
- In controlled simulation, ground-truth optimization of controller gains (subject to a cost function) was used to compare standard BO (single-EI) and GMFBO. GMFBO required as few as 6 real experiments to converge, versus 22 for naive BO under substantial digital twin bias.
- In robotic hardware, GMFBO achieved improved data efficiency, with the acquisition function and correction model dynamically responding to evolving mismatch (e.g., due to frictional changes in the physical system). The method robustly prioritized simulation data when accurate and reverted to real-world evaluation upon DT degradation.
- Competing multi-fidelity BO methods lacking adaptive correction were inferior, especially under significant or time-varying simulation bias.
6. Mathematical Formulation and Algorithmic Summary
Critical formulation details:
- GP surrogate predictive mean and variance for candidate input :
- Correction GP acceptance rule: accept into the surrogate if ; otherwise, discard.
Overall, the optimization proceeds by:
- At each round, evaluating the caEI acquisition function and selecting that maximizes it.
- Querying IS1 (real), IS2 (simulation), or IS3 (corrected simulation) as indicated.
- After each IS1 query, retraining the correction GP, updating GP surrogate and cross-source kernel parameters, and augmenting the training set with nearby corrected samples for local refinement.
7. Impact and Significance
GMFBO as articulated provides an advanced, cost-adaptive mechanism for closed-loop optimization of controllers by blending corrected digital twin information with real measurements in heterogeneous, dynamic environments. By online adaptation of model trust and cross-source kernels, the method maintains robustness to simulation drift, achieves strong convergence with limited high-cost experimentation, and substantially improves data efficiency in real-world robotics and related applications (Nobar et al., 22 Sep 2025). This principled approach generalizes to broader engineering domains characterized by the coexistence of digital twins, real systems, and varying simulation fidelity.