Papers
Topics
Authors
Recent
2000 character limit reached

Reweighted Conformal Prediction Procedure

Updated 17 December 2025
  • The paper introduces a method that guarantees both marginal and mask-conditional coverage by adjusting prediction intervals after imputation.
  • It employs a preimpute–mask–then–correct pipeline using importance weighting or acceptance-rejection to handle various missing data mechanisms like MCAR, MAR, and MNAR.
  • Empirical evaluations demonstrate up to a 30% reduction in prediction interval width while maintaining reliable coverage across different datasets.

A reweighted conformal prediction procedure is a statistical method designed to provide reliable uncertainty quantification when features (covariates) in the dataset are subject to missingness. The foundational challenge it addresses is that classical conformal prediction (CP) does not guarantee valid coverage in the presence of missing covariates, especially when coverage must be controlled not only marginally, but for each possible missingness pattern (known as mask-conditional coverage, MCV). The reweighted conformal approach provides both marginal and mask-conditional coverage guarantees by correcting prediction intervals post-imputation, via importance reweighting or acceptance-rejection schemes. This enables compatibility with standard distributional imputation pipelines while delivering sharper, adaptively valid prediction sets even under complex missing data mechanisms, such as MCAR, MAR, or MNAR (Fan et al., 16 Dec 2025).

1. Problem Formulation and Background

Consider a supervised regression or classification framework where the data-generating process produces i.i.d. triples (X,M,Y)(X, M, Y), with XXX\in\mathcal{X} the covariate vector of dimension dd, M{0,1}dM\in\{0,1\}^d a mask indicating missingness (mi=1m_i=1 if xix_i is missing, 0 otherwise), and YYY\in\mathcal{Y} the response or label. The observed features are X~=mask(X,M)\tilde X = \mathrm{mask}(X, M), with unobserved entries replaced by NA. In this context, conformal prediction aims to construct a prediction set C^α(X~)\hat C_\alpha(\tilde X) such that, for new data,

P(YC^α(X~))1α(MV),P(YC^α(X~)M=m)1α(MCV)P\big(Y \in \hat C_\alpha(\tilde X)\big) \geq 1-\alpha \quad \text{(MV)}, \quad P\big(Y \in \hat C_\alpha(\tilde X) \mid M = m \big) \geq 1 - \alpha \quad \text{(MCV)}

for all masks mm with nonzero probability mass. Standard split conformal prediction furnishes only marginal validity (MV) under exchangeability; mask-conditional validity (MCV) is generally not guaranteed, especially under heteroskedastic or non-random missing patterns (Fan et al., 16 Dec 2025).

2. The Preimpute–Mask–Then–Correct Framework

To handle missing covariates, the framework consists of the following pipeline:

  1. Preimpute: Each calibration sample (X~i,Yi)(\tilde X^i, Y^i) is subjected to distributional imputation via any probabilistic mechanism φ\varphi, yielding imputed XiQ(X~i,Yi)X^i \sim Q(\cdot|\tilde X^i, Y^i) supported on the observed entries.
  2. Mask: For a test-time missingness pattern (mask) mm, each imputed calibration instance is re-masked so that only the features indicated as observed by mm are retained.
  3. Correct: A correction step is applied to address the distributional shift between the imputed-masked calibration set (QmQ_m) and the true masked-conditional law (PmP_m). This is achieved either by reweighting calibration points via the likelihood ratio ωm=dPm/dQm\omega_m = dP_m/dQ_m, or by acceptance-rejection sampling to simulate draws from PmP_m.

A schematic table of the high-level steps:

Step Description Output
Preimpute Draw imputed XiX^i for each calibration (X~i,Yi)(\tilde X^i, Y^i) {X^i,Yi}i=1n\{\hat X^i, Y^i\}_{i=1}^n
Mask Apply mask mm to each X^i\hat X^i {X^obsi(m),Yi}\{\hat X^i_\mathrm{obs}(m), Y^i\}
Correct Apply weighting or rejection sampling for coverage Adjusted calibration scores

This method leverages the calibration regime of split CP and is agnostic to the specific mechanism of imputation φ\varphi.

3. Weighted and Acceptance–Rejection Correction Methods

3.1 Weighted Conformal Prediction

Under the assumption PmQmP_m \ll Q_m, compute the likelihood ratio (importance weight)

ωm(xobs(m),y):=dPmdQm(xobs(m),y)\omega_m(x_\mathrm{obs}(m), y) := \frac{dP_m}{dQ_m}(x_\mathrm{obs}(m), y)

for each calibration instance and test candidate. The normalized conformal weights for prediction set construction become, for i=1,,ni=1,\dots,n,

Wmi(x,y)=ωmij=1nωmj+ωm(x,y),Wmn+1(x,y)=ωm(x,y)j=1nωmj+ωm(x,y)W_m^i(x,y) = \frac{\omega_m^i}{\sum_{j=1}^n \omega_m^j + \omega_m(x,y)}, \quad W_m^{n+1}(x,y) = \frac{\omega_m(x,y)}{\sum_{j=1}^n \omega_m^j + \omega_m(x,y)}

Prediction sets are computed by evaluating the 1α1-\alpha quantile over the weighted empirical distribution of nonconformity scores, yielding

C^αW(X~n+1)={y:s(X~n+1,y)Q1α(weighted scores)}\hat C^W_\alpha(\tilde X^{n+1}) = \{ y : s(\tilde X^{n+1}, y) \leq Q_{1-\alpha}(\text{weighted scores}) \}

where ss is the nonconformity measure and Q1αQ_{1-\alpha} is the weighted quantile (Fan et al., 16 Dec 2025).

3.2 Acceptance–Rejection Corrected CP

If the likelihood ratio ωm\omega_m is bounded, perform acceptance-rejection by sampling UiUnif(0,1)U^i\sim\text{Unif}(0,1); accept calibration point ii if Ui<ωmi/KU^i < \omega_m^i/K with KmaxiωmiK\ge\max_i \omega_m^i. The accepted subset follows the target PmP_m law post-masking. Split CP is then run as usual on this subset (Fan et al., 16 Dec 2025).

4. Theoretical Validity and Robustness Results

Exact Coverage Guarantees

  • Weighted CP: Under absolute continuity and exchangeability, the weighted procedure provides exact mask-conditional validity: P(YC^αWM=m)1αP(Y \in \hat C^W_\alpha | M = m) \ge 1-\alpha.
  • ARC CP: Provided ωm\omega_m is bounded, acceptance-rejection creates a calibration set i.i.d. under PmP_m; split CP then ensures P(YC^αARM=m)1αP(Y \in \hat C^{AR}_\alpha | M = m)\ge 1-\alpha (Fan et al., 16 Dec 2025).

Effect of Imperfect ωm\omega_m Estimation

If only an estimated ω^m\hat\omega_m (normalized to ω~m\tilde\omega_m) is available, the coverage is controlled up to a total variation penalty: P(YC^αW^M=m)(1α)dTV(P~m,Pm)P(Y \in \hat C^{\widehat W}_\alpha|M=m) \ge (1-\alpha) - d_{TV}(\tilde P_m, P_m) with dTVd_{TV} total variation between the estimated and true mask-conditional laws. The practical implication is that weight estimation should be sufficiently accurate (e.g., test-set corr(ω^,ω)0.3\mathrm{corr}(\hat\omega, \omega) \ge 0.3 ensures empirical miscoverage is acceptably small) (Fan et al., 16 Dec 2025).

Necessity of Correction

Empirical ablation shows that omitting the correction step ("impute–mask–split" only) leads to under-coverage (e.g., worst-case mask-conditional coverage \approx89% when targeting 90%), while reweighting or ARC restores validity.

5. Empirical Evaluation and Performance

Experiments span both synthetic and real tabular datasets, evaluating the procedures under MCAR, MAR, and MNAR mechanisms. Key findings:

  • Synthetic: For d=10d=10 covariates, under 50% MCAR, both weighted CP and ARC CP attain nominal 90% mask-conditional coverage. They reduce average interval width by roughly 30% compared to the MDA-Nested baseline, which is substantially more conservative.
  • Real-world: On datasets such as UCI Concrete (8 features,  40%~40\% missing per test), Bike-sharing, and MEPS19, both procedures preserve desired coverage per mask and shrink prediction intervals by 10%12%10\%-12\% relative to conservative alternatives.
  • ARC vs. Weighted: ARC CP is computationally fast, reduces width further on some tasks, and does not break coverage when used with a slightly inflated KK (acceptance rate 50%\gtrsim50\% recommended).

A summary comparison of selected methods:

Method Mask-Conditional Valid Average Width Reduction vs. Baseline
Weighted CP Yes \sim10–30%
ARC CP Yes Largest
MDA-Nested Conservative None
Naive Split No Narrower but under-covers

6. Implementation and Practical Guidance

Integration into standard supervised pipelines is streamlined:

  • Any off-the-shelf distributional imputer (e.g., MICE, Bayesian Ridge) fills the calibration set a single time.
  • The test instance is not imputed; instead, the imputed calibration set is masked to match the test-time missingness.
  • Correction is applied post hoc: either weighted CP or ARC CP is used to calibrate prediction sets from any base model.
  • For weighted CP, a coarse search grid over yy is recommended (h0.01h\sim0.01 of response range).
  • For ARC CP, KK should be slightly larger than the estimated max ωm\omega_m, ensuring adequate sample size.
  • For ωm\omega_m estimation, balanced classifier approaches (e.g., histogram-GBDT, logistic regression) with enough calibration data (n100n\ge100) yield robust results; extremely poor weight estimation degrades coverage reliability (Fan et al., 16 Dec 2025).

7. Limitations and Prospective Directions

The primary limitations include the requirement for accurate estimation (or boundedness) of the likelihood ratio ωm\omega_m; grossly inaccurate estimators or extreme unboundedness can break coverage or reduce acceptance rates in ARC CP. Weighted CP quantile searches may be computationally expensive for fine grids or high-dimensional yy. Potential future extensions involve adaptive tuning of KK for improved efficiency/width tradeoff in ARC, online masking for streaming data, robustification for imperfect weights, and generalization to classification or structured-output tasks with missing entries (Fan et al., 16 Dec 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Reweighted Conformal Prediction Procedure.