Bellman Conformal Inference (BCI)

Updated 8 February 2026

Bellman Conformal Inference (BCI) is a framework that generates calibrated predictive intervals for univariate time series by leveraging dynamic programming to balance interval length and long-term coverage.
It formulates a one-dimensional stochastic control problem to optimally select interval parameters, acting as a robust wrapper around black-box forecasting models.
Empirical evaluations show that BCI achieves rigorous non-asymptotic coverage guarantees and produces intervals up to 20% shorter than those from Adaptive Conformal Inference.

Bellman Conformal Inference (BCI) is a framework for producing calibrated predictive intervals for univariate time series by leveraging dynamic programming to minimize average interval length while maintaining rigorous long-term coverage guarantees. BCI operates as a wrapper around arbitrary black-box multi-step forecasting models, directly addressing the potential miscalibration of nominal prediction intervals provided by such models. At each step, BCI formulates and solves a tractable one-dimensional stochastic control problem to select interval parameters, delivering approximately calibrated intervals under arbitrary distribution shifts and temporal dependencies and yielding tighter prediction intervals compared to previous methods such as Adaptive Conformal Inference (ACI) (Yang et al., 2024).

1. Problem Formulation and Calibration Objective

Consider a univariate time series $(Y_1, Y_2, \dots)$ where $Y_t \in \mathcal Y$ is revealed only at time $t+1$ , and let $\mathcal F_{t-1}$ represent all observable information up to time $t$ . At each time $t$ , a black-box forecaster provides, for each horizon $s \in \{t,\dots, t+T-1\}$ , a nominal $(1-\beta)$ -level prediction interval

$C_{s|t}(1-\beta) \subseteq \mathcal Y, \qquad \beta\in[0,1],$

with interval length $L_{s|t}(\beta) = |C_{s|t}(1-\beta)|$ . While ideally $P(Y_s \in C_{s|t}(1-\beta)) = 1-\beta$ , in practice these prediction intervals may be poorly calibrated.

BCI leverages a data-dependent miscoverage index $\alpha_t \in [0,1]$ , adapted to $\mathcal F_{t-1}$ , to determine the prediction interval $C_t := C_{t|t}(1-\alpha_t)$ . Defining the indicator $\mathrm{err}_t = \mathbf{1}\{Y_t \notin C_t\}$ , the calibration objective is strict long-run validity for a pre-specified target $\bar\alpha \in (0,1)$ :

$\limsup_{K\to\infty} \frac{1}{K}\sum_{t=1}^K \mathrm{err}_t \le \bar\alpha \quad \text{almost surely},$

uniformly for any data-generating process, including adversarial or deterministic sequences. The only assumptions are that $C_{s|t}(1-\beta)$ is monotonic in $\beta$ (set inclusion) and $C_{s|t}(1) = \mathcal Y$ is the trivial full-space interval.

2. Stochastic Control Problem and Dynamic Programming

BCI addresses interval selection as a finite-horizon one-dimensional stochastic control problem (SCP) at each time $t$ . The objective is to choose $\{\alpha_{s|t}\}_{s=t}^{t+T-1}$ to minimize expected total interval length, plus a penalty on excess miscoverage: $\min_{\alpha_{t|t},\dots,\alpha_{t+T-1|t}} \mathbb{E} \Bigg[\sum_{s=t}^{t+T-1} L_{s|t}(\alpha_{s|t}) + \lambda_t \max\left(\frac{1}{T}\sum_{s=t}^{t+T-1}\mathrm{err}_{s|t} - \bar\alpha,\, 0\right)\Bigg].$ Here, for each $s$ , $\mathrm{err}_{s|t} = \mathbf{1}\{\alpha_{s|t} > \beta_{s|t}\}$ , with $\beta_{s|t}$ drawn from the analyst's empirical estimate $F_{s|t}$ of the future probability integral transform (PIT). The scalar weight $\lambda_t$ controls the tradeoff between short intervals and coverage. No constraints are required beyond ensuring $\alpha_{s|t} \in [0,1]$ ; the safeguard $C_{s|t}(1) = \mathcal Y$ ensures that if $\lambda_t\le0$ , the trivial solution (maximal interval) is always achievable.

The SCP is solved via dynamic programming on the state $\rho_{s|t} = \sum_{k=t}^{s-1}\mathrm{err}_{k|t}$ (number of miscoverages up to $s-1$ ), with terminal cost at $s=t+T$ : $J_{t+T|t}(\rho) = \lambda_t \max\left(\frac{\rho}{T} - \bar\alpha,\, 0\right).$ The Bellman update admits explicit computation: $J_{s|t}(\rho) = \min_{\alpha\in[0,1]} \mathbb{E}_{\beta\sim F_{s|t}} \left[ L_{s|t}(\alpha) + J_{s+1|t} \left( \rho + \mathbf{1}\{\alpha > \beta\} \right)\right].$ With $D_{s|t}(\rho) = J_{s+1|t}(\rho+1) - J_{s+1|t}(\rho)$ , the optimal action at $(s,\rho)$ is

$\widetilde\alpha_{s|t}(\rho) = \arg\min_{\alpha\in[0,1]} \big\{ L_{s|t}(\alpha) + D_{s|t}(\rho) F_{s|t}(\alpha) \big\}.$

The actual action at time $t$ is $\alpha^*_{t|t} = \widetilde\alpha_{t|t}(0)$ .

3. Interval Construction and Online Updates

Once $\alpha_t$ is determined, the prediction interval for $Y_t$ is $[L_t, U_t] = C_{t|t}(1-\alpha_t)$ . The "uncalibrated PIT" at time $t$ is

$\beta_t = \sup\{\beta: Y_t \in C_{t|t}(1-\beta)\}.$

Then miscoverage is encoded as $\mathrm{err}_t = \mathbf{1}\{\alpha_t > \beta_t\}$ . The update for $\lambda_t$ is performed via an online-gradient step,

$\lambda_{t+1} = \lambda_t - \gamma [\bar\alpha - \mathrm{err}_t], \qquad \gamma = c\,\lambda_{\max},\, c\in(0,1),$

and whenever $\lambda_t > \lambda_{\max}$ , BCI defaults to the full-space interval by truncating $\alpha_t$ to zero. This update ensures that

$\lambda_t \in [-\gamma \bar\alpha,\, \lambda_{\max} + \gamma(1-\bar\alpha)]$

via induction, so that long-run miscoverage is controlled.

4. Algorithmic Workflow

BCI can be summarized in a stepwise form as follows:

Input: Previous $\lambda_{t-1}$ , $\mathrm{err}_{t-1}$ ; multi-step forecasts $\{L_{s|t}(\cdot), F_{s|t}\}_{s=t}^{t+T-1}$ ;
Update the security parameter:

$\lambda_t = \lambda_{t-1} - \gamma [\bar\alpha - \mathrm{err}_{t-1}]$

Solve the stochastic control problem via dynamic programming to obtain $\widetilde\alpha_{s|t}(\rho)$ for all $s$ , $\rho$ ;
Set

$\alpha_t = \begin{cases} \widetilde\alpha_{t|t}(0), & \lambda_t \le \lambda_{\max} \ 0, & \lambda_t > \lambda_{\max} \end{cases}$

and output $C_t(1-\alpha_t)$ ;

Observe $Y_t$ , record $\mathrm{err}_t = \mathbf{1}\{Y_t \notin C_t\}$ , and repeat.

This workflow requires only future-looking forecasts (empirical PITs and nominal interval lengths), compatible with any off-the-shelf forecasting mechanism.

5. Coverage Properties and Theoretical Guarantees

BCI establishes a non-asymptotic bound for average miscoverage. For any starting index $m \ge 0$ and batch of $K$ rounds,

$\left|\frac{1}{K}\sum_{t=m+1}^{m+K} \mathrm{err}_t - \bar\alpha\right| \le \frac{\lambda_{\max} + \gamma}{K\gamma} = \frac{c + 1}{c K}.$

This bound guarantees, by sending $K\to\infty$ , that

$\limsup_{K\to\infty} \frac{1}{K}\sum \mathrm{err}_t \le \bar\alpha$

almost surely, for any data sequence, regardless of stochasticity or stationarity. The approach does not impose assumptions on the forecaster or underlying process.

6. Empirical Evaluation and Comparisons

Empirical assessments utilize datasets including daily logarithmic returns for stocks (e.g., AMD, Amazon, Nvidia), squared return volatility, and Google Trends queries (e.g., "deep learning"). Forecasters include a small transformer for returns, GARCH(1,1) for volatility, and a 5-layer LSTM for Google Trends, each producing nominal intervals by Gaussian quantiles.

The benchmark is Adaptive Conformal Inference (ACI), which recursively updates $\alpha_t$ using

$\alpha_t = \alpha_{t-1} + \gamma [\bar\alpha - \mathrm{err}_{t-1}]$

for set step-sizes. Metrics comprise local 500-point moving averages of miscoverage and interval length: $\mathrm{LocalMiscov}_t = \frac{1}{500}\sum_{s=t-250}^{t+250} \mathrm{err}_s, \qquad \mathrm{LocalLength}_t = \frac{1}{500}\sum_{s=t-250}^{t+250} |C_s(1-\alpha_s)|$ and the proportion of intervals of infinite length (signaling uninformative coverage).

Findings from the data include:

BCI and ACI both achieve near-target 10% miscoverage.
BCI yields consistently shorter average intervals (e.g., return series: 0.08 vs. 0.09) and avoids uninformative, infinite-length intervals observed with ACI under heavy distribution shifts or loose control.
Even when forecaster intervals are well-calibrated (e.g., GARCH on volatility), BCI matches ACI in coverage and interval length while robustly avoiding infinite intervals.
Largest benefits occur when nominal forecaster intervals are poorly calibrated (e.g., LSTM on Google Trends), with BCI reducing average interval widths by approximately 20% at the same coverage level (Yang et al., 2024).

7. Interpretation, Scope, and Relation to Existing Methods

BCI generalizes conformal inference for time series by incorporating dynamic programming and explicit multi-step prediction. Unlike ACI, which updates coverage controllers myopically, BCI reasons over a finite prediction horizon via stochastic control, optimizing the length-vs-coverage tradeoff. The methodology guarantees long-run frequentist coverage under arbitrary nonstationarity, adversarial distribution shifts, and even in the face of poor model calibration, while producing substantially tighter and more informative intervals. A plausible implication is that BCI represents a robust wrapper for any black-box forecasting pipeline, offering rigorous guarantees without assumptions on model correctness or data structure (Yang et al., 2024).

Markdown Report Issue Upgrade to Chat

References (1)

Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series (2024)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Bellman Conformal Inference (BCI).

Bellman Conformal Inference (BCI)

1. Problem Formulation and Calibration Objective

2. Stochastic Control Problem and Dynamic Programming

3. Interval Construction and Online Updates

4. Algorithmic Workflow

5. Coverage Properties and Theoretical Guarantees

6. Empirical Evaluation and Comparisons

7. Interpretation, Scope, and Relation to Existing Methods

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Bellman Conformal Inference (BCI)

1. Problem Formulation and Calibration Objective

2. Stochastic Control Problem and Dynamic Programming

3. Interval Construction and Online Updates

4. Algorithmic Workflow

5. Coverage Properties and Theoretical Guarantees

6. Empirical Evaluation and Comparisons

7. Interpretation, Scope, and Relation to Existing Methods

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research