Adaptive Meta Black-box Optimization (ABOM)

Updated 5 February 2026

ABOM is a meta-optimization framework that integrates offline meta-learning, surrogate modeling via an Attentive Neural Process, and Bayesian optimization for rapid adaptation to unseen tasks.
It constructs a diverse meta-dataset through parallel simulation and uses well-calibrated uncertainty estimates to significantly reduce the number of expensive online evaluations.
Empirical evaluations in urban traffic management demonstrate ABOM's ability to lower waiting vehicles and improve throughput with fewer than 100 trials.

The Adaptive Meta Black-box Optimization Model (ABOM) formalizes a class of meta-optimization frameworks that leverage offline meta-learning and modern surrogate modeling to enable rapid, data-efficient adaptation to unseen black-box optimization tasks. The paradigm is motivated by real-world applications such as urban traffic network design, where conventional optimization or heuristic controllers fail to generalize due to heterogeneity and limited sample budgets. ABOM, as introduced in traffic light management (Yun et al., 2024), combines an offline meta-dataset of task–design–performance pairs, a meta-learned Attentive Neural Process (ANP) surrogate, and Bayesian optimization to deliver sample-efficient, uncertainty-aware optimization of system-level designs such as traffic phase configurations and timing.

1. Formulation of the Meta-Black-box Optimization Problem

ABOM considers a family of black-box optimization tasks defined by an unknown distribution ρ. Each task (e.g., a traffic pattern) p induces a black-box objective $f_p: \mathbb{R}^d \to \mathbb{R}$ that evaluates high-level design variables $x$ (e.g., intersection phase combinations, green-time allocations). The core challenge is to efficiently optimize $x^* = \arg\max_x f_p(x)$ for a new, unseen pattern $p^{unseen}$ , using as few expensive function evaluations as possible.

ABOM operates in a meta-learning regime: it is provided with an offline meta-dataset $D = \{ D^n \}_{n=1}^N$ , where each $D^n=\{(x^n_m, y^n_m)\}_{m=1}^M$ comprises design–performance pairs $(x, y = f_{p^n}(x)+\epsilon)$ collected under $N$ historical tasks (traffic scenarios). This meta-dataset is used to inform and “warm-start” the search for $x^*$ in a new task $p^{unseen}$ , given a limited budget $K$ of online simulations.

Key components:

Variable: global traffic-light design vector $x \in \mathbb{R}^{d}$ , with $d=I \cdot J$ for $I$ intersections and $J$ choices each.
Objective: $f_p(x)$ , noisy, nonconvex, and non-differentiable.
Data: $D$ aggregates prior realizations of $(x, f_{p^n}(x))$ for $n=1,\ldots,N$ .
Goal: Use $D$ to minimize the number of online evaluations needed to solve $\arg\max_x f_{p^{unseen}}(x)$ .

2. Construction and Role of the Offline Meta-Dataset

Data collection is performed entirely offline in parallelized simulation. For each reference pattern $p^n$ , $M$ candidate designs $x^n_m$ are sampled uniformly from the design space (phase-combination one-hot logits, or unnormalized green-time splits). Their simulated performance $y^n_m = f_{p^n}(x^n_m) + \epsilon$ yields $D^n$ without sequential dependency, ensuring the dataset is unbiased and exploits parallel compute.

This process is repeated independently for each $p^n$ ( $N=120$ in large-scale experiments; $M = 200$ ), yielding a meta-dataset $D = \{ D^n \}_{n=1}^N$ that covers a diverse spectrum of traffic conditions and associated interventions. The data supports learning a prior over the mapping from designs to outcomes, enabling generalization to unseen traffic scenarios in a principled few-shot adaptation framework.

A held-out validation split ($5:1$ train/val over $N$ ) is used for hyperparameter tuning of the meta-surrogate.

3. Attentive Neural Process Surrogate: Architecture and Training

3.1 Latent Neural Process Structure

The ABOM surrogate is an Attentive Neural Process (ANP), modeling the conditional stochastic process $f \sim \rho$ :

For any context set $C = \{(x_c, y_c)\}$ $C = {(x_{c}, y_{c})}$ and any target set $T = \{(x_t, y_t)\}$ $T = {(x_{t}, y_{t})}$ :
- ANP introduces a global latent vector $z$ with prior $q_\phi(z|C)$ and defines:
$p(z|C)\prod_{t \in T} p(y_t|x_t, z)$
Encoder: $E_\phi(x_c, y_c) \to r_c$ for each context; updated via self-attention to $R = Attn_{self}(\{r_c\})$ and aggregated $r_C = \text{mean}(R)$ .
Recognition model: $q_\phi(z|C) = \mathcal{N}(\mu_z(r_C), \sigma_z^2(r_C))$ .
Decoder: $p_\theta(y|x, z) = \mathcal{N}(\mu_\theta(x, z, a_x), \sigma_\theta^2(x, z, a_x))$ , where $a_x$ is a cross-attention over $R$ conditioned on query $x$ .

3.2 Training Objective

The surrogate is meta-trained across all $N$ tasks to maximize the evidence lower bound (ELBO):

$\mathcal{L}(\phi,\theta) \geq \mathbb{E}_{q_\phi(z|T)} \left[ \sum_{t \in T} \log p_\theta(y_t|x_t, z) \right] - \mathrm{KL}[q_\phi(z|T) \| q_\phi(z|C)]$

This process alternates between random context/target splits of each $D^n$ , encouraging the model to learn transferable structure for rapid adaptation. The ANP models predictive mean and variance, naturally encoding epistemic uncertainty critical for Bayesian optimization.

3.3 Inference

During online adaptation on $p^{unseen}$ , the ANP receives $D_{on}$ (the current online trials) as context and returns:

Predictive mean: $\mu(x|D_{on}) = \mathbb{E}_{q(z|D_{on})}[\mu_\theta(x, z, a_x)]$
Predictive variance: $\sigma^2(x|D_{on}) = \operatorname{Var}_{q(z|D_{on})}[\mu_\theta(x, z, a_x)] + \mathbb{E}_{q(z|D_{on})}[\sigma^2_\theta(x, z, a_x)]$ These are estimated via Monte Carlo or moment matching.

4. Bayesian Optimization with the ANP Surrogate

ABOM applies Bayesian optimization (BO) to maximize $f_{p^{unseen}}(x)$ using the ANP as a probabilistic surrogate. Standard acquisition functions are employed:

Upper Confidence Bound (UCB):

$a_{UCB}(x) = \mu(x|D_{on}) + \beta \sigma(x|D_{on}), \quad \beta = 2.0$

Expected Improvement (EI):

$Z = \frac{\mu(x) - f^+}{\sigma(x)}, \quad a_{EI}(x) = (\mu(x) - f^+) \Phi(Z) + \sigma(x) \phi(Z)$

Probability of Improvement (PI):

$a_{PI}(x) = \Phi\left(\frac{\mu(x) - f^+}{\sigma(x)}\right)$

Constraints are enforced:

For phase time allocation: simplex constraints on each $x_i$ : $\sum_j \mathrm{softmax}(x_{ij}) = 1$ and $\mathrm{softmax}(x_{ij}) \cdot (c - J g_{min}) + g_{min} \geq 0$ .
For phase combinations: argmax over $J$ softmax probabilities ensures valid discrete selection.

Acquisition maximization over feasible $x$ is solved with L-BFGS-B on the continuous logits.

Optimization Loop

At each online trial:

Compute surrogate mean/variance given current $D_{on}$ .
Maximize the acquisition function $a(x)$ under the constraints to propose next $x$ .
Evaluate $f_{p^{unseen}}(x)$ via simulation.
Augment online context with new $(x, y)$ . The best $x$ found is returned.

5. Algorithmic Workflow and Pseudocode

Phase 1: Offline Data Collection

For $n=1$ to $N$ : sample $M$ designs, evaluate under $f_{p^n}$ , store in $D^n$ .

Phase 2: Meta-training ANP

Until convergence: randomly select $D^n$ , split into context/target, update ANP via stochastic gradient ascent on the ELBO.

Phase 3: Online Adaptation on New Task

Initialize $D_{on} = \emptyset$ .
For $k = 1,\ldots,K$ :
- Compute $\mu(x),\sigma(x)$ given ANP and $D_{on}$ .
- Maximize acquisition $a(x)$ (UCB, EI, or PI) under constraints.
- Query $f_{p^{unseen}}(x)$ , record $(x, y)$ .
Return $x$ with best observed $f_{p^{unseen}}(x)$ .

6. Empirical Evaluation and Impact

ABOM was instantiated on both synthetic $2 \times 2$ to $5 \times 5$ grids and real networks (Hangzhou $4 \times 4$ , Manhattan $16 \times 3$ and $28 \times 7$ ). Meta-dataset: $N=120$ , $M=200$ . Baselines included basic BBO (GA, PSO, CMA-ES), meta BBO (LGA, LES, RGPE, ABLR, FSBO), and RL 1-step MDP controllers (e.g., DQN, PPO).

Key results:

On Hangzhou $4 \times 4$ (phase combination), ABOM: $292.7$ vs. best baseline $294.8$ average waiting vehicles.
On Manhattan $28 \times 7$ (time allocation), ABOM: $3805.4$ vs. $3890.4$ next best.
ABOM reduced waiting vehicles by $2-10\%$ across all networks, converged in $<100$ trials.
Real deployment: 26 intersections, $4.80\%$ vehicle throughput improvement over baseline when run hourly for a week.
Ablations: robust to halved meta-dataset, all three acquisition criteria give similar improvements, ANP preferred over transformer NPs for the available data scale.

7. Interpretation and Significance

ABOM demonstrates that offline meta-learning of surrogate models (ANP) over diverse design–performance datasets, combined with probabilistic BO, enables high sample-efficiency black-box optimization in complex, real-world system design settings. The approach achieves rapid adaptation beyond traditional BBO and RL variants, robustly outperforms a range of meta and naïve baselines on metrics relevant to practice (e.g., waiting vehicles, throughput), and is deployable at urban scale without domain-specific tuning (Yun et al., 2024). The framework is general and could be extended to other black-box systems where acquiring task-generalizable learned surrogates with well-calibrated uncertainty is feasible.

Empirical findings confirm the following:

Parallel offline data generation (random design sampling) is highly effective for meta-surrogate training and allows for maximal use of simulation infrastructure.
Well-calibrated epistemic uncertainty from the ANP is critical for BO sample efficiency, irrespective of acquisition function.
Robustness to meta-dataset size and acquisition choice confirms that meta-BBO with few-shot adaptation is a practical architecture for scalable, adaptive design in high-stake black-box domains.

Principal Reference:

"An Offline Meta Black-box Optimization Framework for Adaptive Design of Urban Traffic Light Management Systems" (Yun et al., 2024)

Markdown Report Issue Upgrade to Chat

References (1)

An Offline Meta Black-box Optimization Framework for Adaptive Design of Urban Traffic Light Management Systems (2024)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Adaptive Meta Black-box Optimization Model (ABOM).