Rerandomization for QTE Estimation
- The paper presents a novel integration of rerandomization into QTE estimation by enforcing covariate balance via the Mahalanobis distance.
- It reveals that rerandomization induces a non-Gaussian limiting distribution while yielding efficiency gains, with variance reductions estimated up to 45%.
- Monte Carlo simulations and empirical studies validate shorter confidence intervals and robust coverage, highlighting practical improvements in causal inference.
Rerandomization for quantile treatment effect (QTE) estimation concerns the integration of rerandomization procedures—designed to enforce covariate balance during experimental treatment assignment—into the estimation and inference of quantile-based causal effects. While complete randomization is a classical standard, rerandomization restricts assignment to allocations yielding desirable covariate balance, quantitatively measured by the Mahalanobis distance in covariate means. This methodology transposes the efficiency benefits achieved for mean (average treatment effect, ATE) estimation to the quantile regime, providing inferential methods that address both finite-sample and asymptotic concerns and accommodating the distinctive, non-Gaussian limiting distributional properties that arise for quantile estimators under rerandomization (Han et al., 18 Jan 2026, Wang et al., 2024).
1. Formal Framework and Rerandomization Criterion
The finite-population potential outcomes framework considers units, each with covariates and fixed potential outcomes and . Treatment assignment is generated such that unit receives treatment if ; the observed outcome is .
The QTE estimand at quantile is the difference in empirical -quantiles,
where is the potential outcome CDF under arm .
Rerandomization (ReM) enforces balance by repeatedly sampling treatment assignments until the Mahalanobis distance between treated and control covariate means, , satisfies , where and is the finite-population covariance. The stringency is controlled by the threshold , which determines the acceptance probability (Han et al., 18 Jan 2026, Wang et al., 2024).
2. Quantile Estimation and Rerandomization-Adjusted Procedures
For the observed data, the empirical CDFs of outcomes in each arm are
and the plug-in quantile estimator is
Thus, the QTE estimator is
The general M-estimator framework characterizes the estimator as the solution to
where, for QTE, and
with . This ensures that and solve for the empirical -quantiles in treated and control groups (Wang et al., 2024).
3. Asymptotic Distribution under Rerandomization
Under complete randomization, is asymptotically normal. In contrast, rerandomization induces a non-Gaussian limiting law. Specifically, under mild regularity and conditioning on : where:
- , independent of .
- for .
- is the asymptotic variance under complete randomization (including density scaling factors).
- is the squared multiple correlation between the QTE estimator and the covariate mean difference (Han et al., 18 Jan 2026).
Therefore, the limiting law is a mixture of normal and truncated normal components ("spike and slab" structure), and is non-Gaussian unless the QTE estimator is uncorrelated with the balanced covariates (Han et al., 18 Jan 2026, Wang et al., 2024).
Augmenting the estimating equation with the rerandomization covariate-mean differences eliminates the non-Gaussian component, restoring exact asymptotic normality (Wang et al., 2024).
4. Variance Estimation and Confidence Intervals
Because some counterfactual (joint potential outcome) covariances are unidentifiable, inference is based on conservative estimators. Denoting sample analogs of variance components as , , and , define
where is the -quantile of the distribution . This interval is conservative: Alternatively, augmenting the estimating equations as above permits the standard sandwich variance estimator and usual Wald-type confidence intervals using the post-augmentation asymptotic normality (Han et al., 18 Jan 2026, Wang et al., 2024).
5. Efficiency Gains and Theoretical Comparison
Rerandomization reduces the asymptotic variance of compared to complete randomization. The percent reduction in asymptotic sampling variance (PRIASV) is characterized as
with $v_{K_n, a_n} = \Var(L_{K_n, a_n}) = P(\chi^2_{K_n+2}\leq a_n)/P(\chi^2_{K_n}\leq a_n)<1$ (Han et al., 18 Jan 2026). Higher reflects greater prognostic strength of the covariates relative to the QTE; more stringent balance (smaller ) increases variance reduction. Empirical simulations demonstrate PRIASV values between 10–50%, consistent with theory, and variance reductions up to 45% for –0.5 (Han et al., 18 Jan 2026, Wang et al., 2024).
6. Simulation Evidence and Empirical Performance
Monte Carlo simulations and real-data applications (e.g., IHDP covariates) validate the theory:
- Variance reductions for are proportional to PRIASV.
- Shorter confidence intervals (10–30% reduction in length) are achieved with maintained or enhanced nominal coverage.
- Coverage robustness persists at moderate sample sizes () and allocation imbalance.
- When key baseline covariates are balanced, the realized variance closely follows times the variance under complete randomization, with quantifying the predictive power of rerandomized covariates for the QTE estimator's influence function (Han et al., 18 Jan 2026, Wang et al., 2024).
7. Implementation Guidance and Practical Considerations
Recommended practices include:
- Select rerandomization covariates () with high predictive power for outcome quantiles or the QTE influence function.
- Use the rerandomization (or stratified rerandomization) procedure: iteratively accept treatment assignments only if the sample Mahalanobis distance for is below the prespecified threshold.
- Estimate QTE by fitting quantile regressions or direct sample quantiles in treated and control arms.
- Estimate variances using either the conservative variance estimator (from the non-Gaussian limit) or, after augmenting the estimating equation with covariate-mean imbalances, the usual sandwich estimator.
- For inference, use either the conservative interval in the rerandomization framework or standard Wald intervals post-augmentation (Han et al., 18 Jan 2026, Wang et al., 2024).
In summary, rerandomization for QTE estimation extends the efficiency and robustness gains previously established for average treatment effect settings, introduces unique non-Gaussian inferential phenomena, and is supported by both theoretical and empirical evidence for substantial gains in estimation precision and interval validity in practical applications (Han et al., 18 Jan 2026, Wang et al., 2024).