Doubly Robust Confidence Intervals
- Doubly robust confidence intervals are statistical methods that combine propensity score and outcome regression models to ensure valid inference if at least one model is correctly specified.
- They use augmented inverse probability weighting and nonparametric smoothing to construct uniform confidence bands for function-valued targets like the conditional average treatment effect function.
- The approach avoids the curse of dimensionality by modeling high-dimensional covariates parametrically while focusing nonparametric estimation on low-dimensional effect modifiers.
Doubly robust confidence intervals are a class of statistical inference procedures designed to yield valid coverage for causal or semiparametric parameters such as the average treatment effect or the conditional average treatment effect function (CATEF), even when one of two underlying nuisance models—typically a model for the propensity score or a model for the outcome regression—is misspecified. These approaches achieve robustness against model misspecification for both point estimation and inferential uncertainty, with adaptations to high-dimensional, nonparametric, and function-valued (e.g., uniform band) settings. The development of doubly robust confidence intervals has facilitated valid inference in complex and high-dimensional applications while maintaining interpretability and theoretically justified error control.
1. Fundamental Principles of Doubly Robust Confidence Intervals
The doubly robust (DR) inference paradigm leverages two parallel nuisance models: a model for the propensity score and a model for the conditional mean outcome for . A DR estimator combines these models, typically in an augmented inverse probability weighting (AIPW) form, to estimate a functional target (e.g., the CATEF, ).
The essential property is that the estimator is consistent if at least one of the two models is correctly specified. If a parametric model for the propensity score or for the conditional mean is well specified, DR estimation produces asymptotically normal estimators, with confidence intervals constructed using variance estimates derived from efficient influence functions or empirical resampling. In function-valued settings, uniform confidence bands are constructed to ensure coverage over a range or interval of interest.
The local linear DR estimator for is given by: where is the DR score, is a kernel function, and is a bandwidth parameter. The structure enforces that the influence of estimation error in the nuisance functions enters only at a second-order rate under regular conditions.
2. Construction of Uniform Confidence Bands
For function-valued targets such as the CATEF, uniform bands covering an interval of the covariate of interest are constructed using a nonparametric regression of the DR score on . Pointwise standard errors are estimated at each as: where is a kernel density estimator and estimates the conditional variance of the DR score at .
The uniform band is constructed as: where is a critical value derived from the supremum of a Gaussian process approximating the rescaled estimation error process: with a function of bandwidth, domain length, and kernel specifics.
These uniform bands accommodate the sampling distribution's tail behavior, controlling for multiple comparisons over the continuum of .
3. Avoidance of the Curse of Dimensionality
The proposed DR procedure avoids the curse of dimensionality by decomposing the set of covariates into a (typically low-dimensional) set of interest (for CATEF heterogeneity analysis) and a high-dimensional set (for confounding adjustment). Nuisance functions , are modeled parametrically to avoid high-dimensional smoothing, while the nonparametric estimation is confined to the low-dimensional space of .
Consequently, the nonparametric step inherits favorable convergence rates and mitigates sample size inefficiency, enabling feasible inference even when the number of adjustment covariates is large.
4. Empirical Applications and Monte Carlo Results
The DR uniform confidence band framework has been applied to assess heterogeneous treatment effects, notably in evaluating the effect of maternal smoking on birth weight. In such settings, is birth weight, is an indicator of smoking, are extensive controls, and (e.g., maternal age) is the covariate of interest for effect heterogeneity.
Empirical results for datasets (from Pennsylvania and North Carolina) display that the constructed confidence bands reveal statistically significant negative effects of smoking on birth weight in specific age intervals (e.g., 20–30 years). This supports the practical utility of the DR band for identifying regions of robust effect heterogeneity.
Monte Carlo simulations, conducted for and , compare scenarios of correct and incorrect model specification. When at least one model is correct, the DR estimator is approximately unbiased with small root-mean-squared errors and correct confidence band coverage (approaching 95% and 99%). Estimators relying solely on IPW or regression adjustment are more vulnerable to model misspecification. Compared to pointwise or Gumbel-based uniform bands, the proposed critical value yields bands closer to nominal coverage and avoids excessive conservatism.
5. Regularity Conditions and Limitations
The methodology is justified under several key assumptions:
- Unconfoundedness: ,
- CATEF smoothness: is at least twice continuously differentiable over ,
- Kernel function is symmetric, well-behaved, and differentiable,
- Bandwidth is chosen for undersmoothing (ensuring asymptotic bias is negligible),
- The dimension of is less than 4.
Limitations include vulnerability to bias if both the propensity score and regression models are misspecified, and the coverage of the confidence band relies on asymptotic properties that may be affected by sample size and violation of technical conditions. The theoretical uniform coverage relies on advanced Gaussian process approximations and an undersmoothing bias regime.
6. Mathematical Formulation
The estimator’s core components are:
The uniform confidence band is
where
and
7. Implications and Extensions
The uniform doubly robust inference approach, by combining parametric adjustment for high-dimensional controls with nonparametric smoothing only over the low-dimensional effect modifiers, establishes a practical and theoretically principled route to uniform confidence bands for heterogeneous treatment effects. This paradigm is particularly valuable for observational studies where complex confounding adjustment is necessary, but the main scientific interest centers on interpretable, low-dimensional covariates.
The methodological innovation provides a template that can be extended to alternative functionals or to alternative semiparametric inference problems. Future work may focus on further relaxing modeling assumptions, bandwidth/data-driven tuning, and extensions to broader classes of treatment effect functionals or more challenging data regimes.