Inverting estimating equations for causal inference on quantiles (2401.00987v2)
Abstract: The causal inference literature frequently focuses on estimating the mean of the potential outcome, whereas quantiles of the potential outcome may carry important additional information. We propose a unified approach, based on the inverse estimating equations, to generalize a class of causal inference solutions from estimating the mean of the potential outcome to its quantiles. We assume that a moment function is available to identify the mean of the threshold-transformed potential outcome, based on which a convenient construction of the estimating equation of quantiles of potential outcome is proposed. In addition, we give a general construction of the efficient influence functions of the mean and quantiles of potential outcomes, and explicate their connection. We motivate estimators for the quantile estimands with the efficient influence function, and develop their asymptotic properties when either parametric models or data-adaptive machine learners are used to estimate the nuisance functions. A broad implication of our results is that one can rework the existing result for mean causal estimands to facilitate causal inference on quantiles. Our general results are illustrated by several analytical and numerical examples.
- Abadie, A., Angrist, J., and Imbens, G. (2002), “Instrumental variables estimates of the effect of subsidized training on the quantiles of trainee earnings,” Econometrica, 70, 91–117.
- Angrist, J. and Imbens, G. (1995), “Identification and estimation of local average treatment effects,” .
- Angrist, J. D., Imbens, G. W., and Rubin, D. B. (1996), “Identification of causal effects using instrumental variables,” Journal of the American statistical Association, 91, 444–455.
- Bang, H. and Robins, J. M. (2005), “Doubly robust estimation in missing data and causal inference models,” Biometrics, 61, 962–973.
- Bodory, H., Huber, M., and Lafférs, L. (2022), “Evaluating (weighted) dynamic treatment effects by double machine learning,” The Econometrics Journal, 25, 628–648.
- Cattaneo, M. D. (2010), “Efficient semiparametric estimation of multi-valued treatment effects under ignorability,” Journal of Econometrics, 155, 138–154.
- Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018), “Double/debiased machine learning for treatment and structural parameters: Double/debiased machine learning,” The Econometrics Journal, 21.
- Chernozhukov, V., Escanciano, J. C., Ichimura, H., Newey, W. K., and Robins, J. M. (2022a), “Locally robust semiparametric estimation,” Econometrica, 90, 1501–1535.
- Chernozhukov, V. and Hansen, C. (2005), “An IV model of quantile treatment effects,” Econometrica, 73, 245–261.
- Chernozhukov, V., Newey, W., Singh, R., and Syrgkanis, V. (2022b), “Automatic debiased machine learning for dynamic treatment effects and general nested functionals,” arXiv preprint arXiv:2203.13887.
- Ding, P. and Lu, J. (2017), “Principal stratification analysis using principal scores,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 79, 757–777.
- Firpo, S. (2007), “Efficient semiparametric estimation of quantile treatment effects,” Econometrica, 75, 259–276.
- Frangakis, C. E. and Rubin, D. B. (2002), “Principal stratification in causal inference,” Biometrics, 58, 21–29.
- Frölich, M. and Melly, B. (2008), “Unconditional quantile treatment effects under endogeneity,” .
- — (2013), “Unconditional quantile treatment effects under endogeneity,” Journal of Business & Economic Statistics, 31, 346–357.
- Hahn, J. (1998), “On the role of the propensity score in efficient semiparametric estimation of average treatment effects,” Econometrica, 315–331.
- Hines, O., Dukes, O., Diaz-Ordaz, K., and Vansteelandt, S. (2022), “Demystifying statistical learning based on efficient influence functions,” The American Statistician, 76, 292–304.
- Hogan, J. W. and Lee, J. Y. (2004), “Marginal structural quantile models for longitudinal observational studies with time-varying treatment,” Statistica Sinica, 927–944.
- Hu, B. and Nan, B. (2023), “Conditional Distribution Function Estimation Using Neural Networks for Censored and Uncensored Data,” Journal of Machine Learning Research, 24, 1–26.
- Imai, K., Keele, L., and Tingley, D. (2010), “A general approach to causal mediation analysis.” Psychological Methods, 15, 309.
- Jiang, Z., Yang, S., and Ding, P. (2022), “Multiply robust estimation of causal effects under principal ignorability,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 84, 1423–1445.
- Kennedy, E. H., Ma, Z., McHugh, M. D., and Small, D. S. (2017), “Non-parametric methods for doubly robust estimation of continuous treatment effects,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 79, 1229–1245.
- Newey, W. K. (1994), “The asymptotic variance of semiparametric estimators,” Econometrica: Journal of the Econometric Society, 1349–1382.
- Pakes, A. and Pollard, D. (1989), “Simulation and the asymptotics of optimization estimators,” Econometrica: Journal of the Econometric Society, 1027–1057.
- Rosenbaum, P. R. and Rubin, D. B. (1983), “The central role of the propensity score in observational studies for causal effects,” Biometrika, 70, 41–55.
- Rotnitzky, A., Smucler, E., and Robins, J. M. (2021), “Characterization of parameters with a mixed bias property,” Biometrika, 108, 231–238.
- Rubin, D. B. (2006), “Causal inference through potential outcomes and principal stratification: application to studies with” censoring” due to death,” Statistical Science, 299–309.
- Tchetgen, E. J. T. and Shpitser, I. (2012), “Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness, and sensitivity analysis,” Annals of Statistics, 40, 1816.
- Tran, L., Yiannoutsos, C., Wools-Kaloustian, K., Siika, A., Van Der Laan, M., and Petersen, M. (2019), “Double robust efficient estimators of longitudinal treatment effects: comparative performance in simulations and a case study,” The international journal of biostatistics, 15, 20170054.
- van der Laan, M. J. and Rubin, D. (2006), “Targeted maximum likelihood learning,” The International Journal of Biostatistics, 2.
- Zhang, Z., Chen, Z., Troendle, J. F., and Zhang, J. (2012), “Causal inference on quantiles with an obstetric application,” Biometrics, 68, 697–706.
- Zheng, W. and van der Laan, M. J. (2010), “Asymptotic theory for cross-validated targeted maximum likelihood estimation,” U.C. Berkeley Division of Biostatistics Working Paper Series.
- Zhou, X. (2022), “Semiparametric estimation for causal mediation analysis with multiple causally ordered mediators,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 84, 794–821.