Survey calibration for causal inference: a simple method to balance covariate distributions
Abstract: This paper proposes a~simple, yet powerful, method for balancing distributions of covariates for causal inference based on observational studies. The method makes it possible to balance an arbitrary number of quantiles (e.g., medians, quartiles, or deciles) together with means if necessary. The proposed approach is based on the theory of calibration estimators (Deville and S\"arndal 1992), in particular, calibration estimators for quantiles, proposed by Harms and Duchesne (2006). The method does not require numerical integration, kernel density estimation or assumptions about the distributions. Valid estimates can be obtained by drawing on existing asymptotic theory. An~illustrative example of the proposed approach is presented for the entropy balancing method and the covariate balancing propensity score method. Results of a~simulation study indicate that the method efficiently estimates average treatment effects on the treated (ATT), the average treatment effect (ATE), the quantile treatment effect on the treated (QTT) and the quantile treatment effect (QTE), especially in the presence of non-linearity and mis-specification of the models. The proposed approach can be further generalized to other designs (e.g. multi-category, continuous) or methods (e.g. synthetic control method). An open source software implementing proposed methods is available.
- Program evaluation and causal inference with high-dimensional data. Econometrica, 85(1):233–298.
- Benjamin, D. J. (2003). Does 401 (k) eligibility increase saving?: Evidence from propensity score subclassification. Journal of Public Economics, 87(5-6):1259–1290.
- Chen, Y. (2020). A distributional synthetic control method for policy evaluation. Journal of Applied Econometrics, 35(5):505–525.
- Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68.
- Calibration estimators in survey sampling. Journal of the American statistical Association, 87(418):376–382.
- Improving covariate balancing propensity score: A doubly robust and efficient approach. URL: https://imai. fas. harvard. edu/research/CBPStheory. html.
- Firpo, S. (2007). Efficient Semiparametric Estimation of Quantile Treatment Effects. Econometrica, 75(1):259–276.
- Covariate balancing propensity score for a continuous treatment: Application to the efficacy of political advertisements. The Annals of Applied Statistics, 12(1):156–177.
- CBPS: Covariate Balancing Propensity Score. R package version 0.23.
- Inverse probability tilting for moment condition models with missing data. The Review of Economic Studies, 79(3):1053–1079.
- Greifer, N. (2023). WeightIt: Weighting for Covariate Balance in Observational Studies. R package version 0.14.2.
- Gunsilius, F. F. (2023). Distributional Synthetic Controls. Econometrica, 91(3):1105–1117.
- Hainmueller, J. (2012). Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political analysis, 20(1):25–46.
- On calibration estimation for quantiles. Survey Methodology, 32(1):37–52.
- Hazlett, C. (2020). Kernel Balancing: A Flexible Non-Parametric Weighting Procedure for Estimating Causal Effects. SSRN Electronic Journal, 30(3).
- Independence weights for causal inference with continuous treatments. Journal of the American Statistical Association, (just-accepted):1–25.
- Energy balancing of covariate distributions. arXiv preprint arXiv:2004.13962.
- Covariate balancing propensity score. Journal of the Royal Statistical Society Series B: Statistical Methodology, 76(1):243–263.
- Kim, J. K. (2010). Calibration estimation using exponential tilting in sample surveys. Survey Methodology, 36:145–155.
- Lumley, T. (2004). Analysis of complex survey samples. Journal of Statistical Software, 9(1):1–19. R package verson 2.2.
- Robust estimation of causal effects via a high-dimensional covariate balancing propensity score. Biometrika, 107(3):533–554.
- R Core Team (2023). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
- Rosenbaum, P. R. (1987). Model-based direct adjustment. Journal of the American statistical Association, 82(398):387–394.
- The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1):41–55.
- Covariate distribution balance via propensity scores. Journal of Applied Econometrics, 37(6):1093–1120.
- Minimal dispersion approximately balancing weights: asymptotic properties and practical considerations. Biometrika, 107(1):93–105.
- Calibration weighting methods for complex surveys. International Statistical Review, 84(1):79–98.
- Hierarchically Regularized Entropy Balancing. Political Analysis, 31(3):457–464.
- Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference. International Statistical Review, page insr.12518.
- Zhao, Q. (2019). Covariate balancing propensity score by tailored loss functions. The Annals of Statistics, 47(2).
- Entropy balancing is doubly robust. Journal of Causal Inference, 5(1):20160010.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.