Papers
Topics
Authors
Recent
Search
2000 character limit reached

Autoregressive Multiplier Online Bootstrap

Updated 9 June 2026
  • The paper introduces an autoregressive online bootstrap technique that generates dependent weights to mimic intrinsic time series correlations.
  • It employs an AR(1) weight sequence with normalization, enabling constant time and memory updates for real-time inference on streaming data.
  • Empirical and theoretical analyses confirm robust uncertainty quantification and valid variance estimation under weak dependence.

The Autoregressive Multiplier Online Bootstrap is an online resampling technique for dependent data streams, particularly time series. It generates bootstrap replicates by associating each observation with algorithmically dependent random weights, derived from an autoregressive process with coefficients that approach unity. This ensures increasing local correlation among weights, enabling the method to respect and mimic intrinsic dependencies in the observed data. Designed for streaming contexts, the procedure delivers online updates in constant time and memory per step. Theoretical guarantees hold for a broad class of stationary, weakly dependent processes, with empirical investigations confirming robust uncertainty quantification and coverage properties even in settings with complex dependence structures (Palm et al., 2023).

1. Definition of the Autoregressive Weight Sequence

The core of the method is a time-inhomogeneous AR(1) process generating the sequence of bootstrap weights {Vt}\{V_t\}: V0=0,V_0 = 0,

Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,

where

  • ρt=1tβ\rho_t = 1 - t^{-\beta} for β(0,12)\beta \in (0, \tfrac12),
  • {εt}t1\{\varepsilon_t\}_{t \geq 1} are i.i.d. random variables with E[εt]=0\mathbb{E}[\varepsilon_t]=0 and Var(εt)=1\mathrm{Var}(\varepsilon_t)=1, typically εtN(0,1)\varepsilon_t \sim N(0,1).

The process is "nearly non-stationary" due to ρt1\rho_t \to 1, inducing strong and growing dependence between successive V0=0,V_0 = 0,0. Weights are normalized at each time via

V0=0,V_0 = 0,1

which asymptotically converges to unity, making normalization negligible in large samples. This autoregressive construction allows for locally dependent bootstrap weights, preserving key features of the time series' dependence structure.

2. Online Algorithm and Computational Properties

The method is inherently online, maintaining V0=0,V_0 = 0,2 independent bootstrap replicates of the sample mean V0=0,V_0 = 0,3 for an incoming time series V0=0,V_0 = 0,4. For each replicate V0=0,V_0 = 0,5:

  1. Initialization: V0=0,V_0 = 0,6, V0=0,V_0 = 0,7, V0=0,V_0 = 0,8.
  2. At each V0=0,V_0 = 0,9:
    • Observe Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,0.
    • For Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,1:

      1. Draw Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,2.
      2. Compute Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,3.
      3. Update weight: Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,4.
      4. Update running mean: Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,5.
      5. Update bootstrap mean:

      Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,6

  • After updating all chains, summarize the empirical distribution Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,7 for inference.

Computationally, each new observation costs Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,8 time and Vt=1+ρt(Vt11)+1ρt2  εt,V_t = 1 + \rho_t (V_{t-1} - 1) + \sqrt{1 - \rho_t^2} \;\varepsilon_t,9 memory. Past data ρt=1tβ\rho_t = 1 - t^{-\beta}0 need not be stored and no block recomputation is required.

3. Theoretical Guarantees and Statistical Properties

Let ρt=1tβ\rho_t = 1 - t^{-\beta}1 denote the strong-mixing coefficients and ρt=1tβ\rho_t = 1 - t^{-\beta}2 for the stationary sequence ρt=1tβ\rho_t = 1 - t^{-\beta}3. Under the following conditions:

  • (A1): ρt=1tβ\rho_t = 1 - t^{-\beta}4,
  • (A2): ρt=1tβ\rho_t = 1 - t^{-\beta}5 for some ρt=1tβ\rho_t = 1 - t^{-\beta}6,
  • (A3): ρt=1tβ\rho_t = 1 - t^{-\beta}7,

the following results hold for all ρt=1tβ\rho_t = 1 - t^{-\beta}8:

ρt=1tβ\rho_t = 1 - t^{-\beta}9

yields valid asymptotic confidence intervals via bootstrap quantiles.

  • Variance Estimation: For β(0,12)\beta \in (0, \tfrac12)0 and the AR-bootstrap variance estimator

β(0,12)\beta \in (0, \tfrac12)1

it holds that: - The bias decreases as β(0,12)\beta \in (0, \tfrac12)2. - The variance is β(0,12)\beta \in (0, \tfrac12)3. - The mean squared error is optimized for β(0,12)\beta \in (0, \tfrac12)4. - β(0,12)\beta \in (0, \tfrac12)5 in probability.

4. Comparison with Classical Bootstrap Methods

The primary alternatives are block bootstrap and iid-multiplier bootstrap:

  • Block Bootstrap: Builds overlapping blocks of length β(0,12)\beta \in (0, \tfrac12)6, with block resampling or block-level dependent multipliers. Computational cost grows to β(0,12)\beta \in (0, \tfrac12)7 per change since all blocks must be re-formed when β(0,12)\beta \in (0, \tfrac12)8 increases, making it noncompetitive for streaming data.
  • IID Multiplier Bootstrap: Assigns independent weights to each data point, achieving β(0,12)\beta \in (0, \tfrac12)9 per update. Such schemes fail under dependent data, with empirical coverage rates collapsing even in the presence of mild short-range correlation.

The AR-multiplier online bootstrap introduces dependence among weights that grows with {εt}t1\{\varepsilon_t\}_{t \geq 1}0, but remains "locally dependent": multipliers for observations separated by {εt}t1\{\varepsilon_t\}_{t \geq 1}1 are nearly independent. This structure permits online computation in {εt}t1\{\varepsilon_t\}_{t \geq 1}2 per observation (per chain) and offers theoretical validity for stationary mixing sequences.

The bias-variance trade-off is summarized in the following table:

Method MSE Decay Rate Update Cost
Block Bootstrap {εt}t1\{\varepsilon_t\}_{t \geq 1}3 (for {εt}t1\{\varepsilon_t\}_{t \geq 1}4) {εt}t1\{\varepsilon_t\}_{t \geq 1}5
AR-multiplier Online {εt}t1\{\varepsilon_t\}_{t \geq 1}6 (for {εt}t1\{\varepsilon_t\}_{t \geq 1}7) {εt}t1\{\varepsilon_t\}_{t \geq 1}8

For streaming applications, this computational advantage of the AR-bootstrap offsets its slightly inferior statistical efficiency relative to block bootstrap (Palm et al., 2023).

5. Empirical Evaluations on Dependent Data

Empirical results confirm the theoretical claims:

  • Linear MA(q) Model: For {εt}t1\{\varepsilon_t\}_{t \geq 1}9 with E[εt]=0\mathbb{E}[\varepsilon_t]=00, the AR-bootstrap achieves nominal interval coverage for the sample mean. IID multipliers fail for E[εt]=0\mathbb{E}[\varepsilon_t]=01. The MA-block method is valid but computationally intensive.
  • Nonlinear Functionals: For E[εt]=0\mathbb{E}[\varepsilon_t]=02 and the statistic E[εt]=0\mathbb{E}[\varepsilon_t]=03, bootstrap validity for the sample mean extends by the delta method to smooth transformations.
  • MA(2)–GARCH(1,1) Model: For a process E[εt]=0\mathbb{E}[\varepsilon_t]=04 combining GARCH and moving average structure, the AR-bootstrap yields approximately correct 90% coverage and consistent variance estimation.

Across all considered scenarios, the AR-multiplier online bootstrap delivers consistent uncertainty quantification with minor variance increases relative to block multiplier schemes, while retaining E[εt]=0\mathbb{E}[\varepsilon_t]=05 computation per update.

6. Practical Implementation Guidelines

  • Tuning E[εt]=0\mathbb{E}[\varepsilon_t]=06: E[εt]=0\mathbb{E}[\varepsilon_t]=07 optimizes the mean squared error. Adjustment is possible based on a priori knowledge: increasing E[εt]=0\mathbb{E}[\varepsilon_t]=08 reduces variance but induces more bias (faster forgetting); decreasing E[εt]=0\mathbb{E}[\varepsilon_t]=09 increases dependence, reducing bias but potentially raising variance.
  • Number of Chains Var(εt)=1\mathrm{Var}(\varepsilon_t)=10: Empirically, Var(εt)=1\mathrm{Var}(\varepsilon_t)=11–Var(εt)=1\mathrm{Var}(\varepsilon_t)=12 yields stable inference for typical quantile estimation.
  • Initialization: Weight chains are initialized at zero; a "burn-in" of Var(εt)=1\mathrm{Var}(\varepsilon_t)=13 steps can be disregarded in small samples to mitigate transient initialization effects.
  • Multivariate and Nonlinear Statistics: Apply the same AR-multiplier chain across all dimensions of a vector or employ the delta method for smooth functionals of the mean.
  • Computational Load: Each observation requires three multiplications per chain and one standard normal draw; memory cost is Var(εt)=1\mathrm{Var}(\varepsilon_t)=14. There is no need to store the historical data series.

The AR-multiplier online bootstrap thus provides a computationally efficient and theoretically sound alternative to classical block-based bootstrapping for dependent data in real-time environments (Palm et al., 2023).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Autoregressive Multiplier Online Bootstrap.