Replicated Lugsail Batch Means Estimator
- Replicated lugsail batch means estimator is a technique that combines lugsail lag-window adjustments with batch means and replication to improve covariance estimation in time series.
- It modifies standard weighted batch means through an inflation factor and weight to effectively counteract negative bias, even under strong positive autocorrelation.
- Replication across independent runs reduces variance by a factor of 1/R, making the method robust for simulation studies and MCMC output analysis.
The replicated lugsail batch means estimator is a methodology for the estimation of time-average covariance matrices of stationary stochastic processes, including those encountered in Markov chain Monte Carlo (@@@@2@@@@) output analysis and simulation studies. This framework integrates a lugsail transformation into traditional lag windows, weighted batch means (BM) estimators, and replication across independent runs to achieve superior finite-sample bias and variance properties, especially in the presence of strong positive autocorrelation (Vats et al., 2018).
1. Lugsail Lag-Window Construction
The lugsail lag window modifies any symmetric base kernel or lag window with , producing a new family of windows that address negative bias in spectral variance and covariance estimation. Given an inflation factor and a weight (potentially sequence-dependent as with ), the lugsail window is defined as:
Key properties:
- .
- For (where is the bias order for the base kernel) and original BM bias , induces a positive first-order bias to offset negative bias.
- The window can exceed 1 initially, yielding a "lugsail" shape.
2. Weighted Batch Means Estimation with Lugsail Windows
Let be a (possibly -dimensional) stationary time series of length . Choosing a maximal batch size (), the construction proceeds as follows:
- Let be the number of non-overlapping batches for batch size .
- For batch index , the -th batch mean is
- The corresponding second-difference weights:
The lugsail weighted BM estimator is then
A jackknife-style equivalent is:
where is the standard BM estimator for batch size .
3. Replicated Estimation Across Independent Runs
For multiple independent realizations (e.g., independent MCMC chains, each of length ), let denote the lugsail BM estimate from the -th run. The replicated lugsail estimator is:
Empirical variance of the replicated estimator across chains is:
Replication substantially reduces estimator variance by a factor of $1/R$, with negligible increase in bias for large .
4. Bias and Variance Under -Mixing Conditions
Under -mixing (strong mixing) with bounded fourth moments (Assumption A.1 of Vats & Flegal (2019)), if the ordinary BM exhibits first-order bias and one chooses , , the bias and variance for lugsail BM are:
- Bias:
where , with the autocovariance function.
- Variance (for entry ):
For the replicated estimator:
- Bias remains as above.
- Variance is reduced by $1/R$.
5. Recommended Parameter Choices and Practical Guidance
Parameter selection is guided by the underlying process's autocorrelation structure. Vats & Flegal (2019) prescribe rules-of-thumb:
- Moderate correlation (): Zero-lugsail with , (zero first-order bias).
- Moderate to high correlation (): Adaptive lugsail with , .
- High to extreme correlation (): Over-lugsail with , , introducing a small positive bias to counteract large negative higher-order terms.
- Batch size recommendation: .
Replication () is strongly recommended when computational resources allow, as it reduces estimator variance by $1/R$ with no substantial impact on bias.
6. Context, Applications, and Extension
The replicated lugsail batch means estimator is designed for broad applicability in MCMC, steady-state simulation, and time series analysis, particularly in regimes with substantial positive serial dependence. Its ability to convert any existing lag window to a lugsail form without new assumptions, and its compatibility with ordinary batch means estimators, makes it particularly practical for large-scale simulation studies (Vats et al., 2018). The approach is supported by theoretical results on bias and variance, substantially weakening mixing requirements for consistency.
The lugsail methodology has demonstrated improved finite-sample reliability over conventional kernels in vector autoregressive models and Bayesian logistic regression, where conventional methods suffer from severe negative bias.
7. Summary Table: Parameter Choices by Correlation Regime
| Correlation Type | Lugsail Parameters | Bias Effect |
|---|---|---|
| Moderate () | , | Zero first-order bias |
| Moderate–High () | , adaptive | Bias zero as |
| High–Extreme () | , | Small positive first-order bias |
This estimator has become a preferred tool for practitioners requiring accurate time-average covariance estimation from highly correlated simulation or MCMC data, especially when computational efficiency and consistency under weak mixing are essential (Vats et al., 2018).