The Connection Between R-Learning and Inverse-Variance Weighting for Estimation of Heterogeneous Treatment Effects (2307.09700v2)
Abstract: Many methods for estimating conditional average treatment effects (CATEs) can be expressed as weighted pseudo-outcome regressions (PORs). Previous comparisons of POR techniques have paid careful attention to the choice of pseudo-outcome transformation. However, we argue that the dominant driver of performance is actually the choice of weights. For example, we point out that R-Learning implicitly performs a POR with inverse-variance weights (IVWs). In the CATE setting, IVWs mitigate the instability associated with inverse-propensity weights, and lead to convenient simplifications of bias terms. We demonstrate the superior performance of IVWs in simulations, and derive convergence rates for IVWs that are, to our knowledge, the fastest yet shown without assuming knowledge of the covariate distribution.
- Recursive partitioning for heterogeneous causal effects. Proc. Natl. Acad. Sci. U. S. A., 113(27):7353–7360, July 2016.
- Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4):962–973, December 2005.
- Some new asymptotic theory for least squares series: Pointwise and uniform results. J. Econom., 186(2):345–366, June 2015.
- Bickel, P. J. On adaptive estimation. Ann. Stat., 10(3):647–671, 1982.
- Estimating integrated squared density derivatives: Sharp best order of convergence estimates. Sankhyā: The Indian Journal of Statistics, Series A (1961-2002), 50(3):381–393, 1988.
- Linear regression with censored data. Biometrika, 66(3):429–436, 1979.
- A general statistical framework for subgroup identification and comparative treatment scoring. Biometrics, 73(4):1199–1209, December 2017.
- Locally robust semiparametric estimation. Econometrica, 90(4):1501–1535, 2022a.
- Automatic debiased machine learning of causal and structural effects. Econometrica, 90(3):967–1027, 2022b.
- Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms. In Banerjee, A. and Fukumizu, K. (eds.), Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, volume 130 of Proceedings of Machine Learning Research, pp. 1810–1818. PMLR, 2021.
- Targeted learning ensembles for optimal individualized treatment rules with time-to-event outcomes. Biometrika, 105(3):723–738, September 2018.
- Censored regression: Local linear approximations and their applications. J. Am. Stat. Assoc., 89(426):560–570, June 1994.
- Three-way Cross-Fitting and Pseudo-Outcome regression for estimation of conditional effects and other linear functionals. June 2023.
- Orthogonal statistical learning. January 2019.
- Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects. June 2017.
- Hill, J. L. Bayesian nonparametric modeling for causal inference. J. Comput. Graph. Stat., 20(1):217–240, January 2011.
- Propensity score modeling strategies for the causal analysis of observational data. Biostatistics, 3(2):179–193, June 2002.
- Estimating treatment effect heterogeneity in randomized program evaluation. aoas, 7(1):443–470, March 2013.
- Kennedy, E. H. Towards optimal doubly robust estimation of heterogeneous causal effects. May 2022a.
- Kennedy, E. H. Semiparametric doubly robust targeted double machine learning: a review. March 2022b.
- Sharp instruments for classifying compliers and generalizing causal effects. Ann. Stat., 2020.
- Minimax rates for heterogeneous causal effect estimation. March 2022.
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proc. Natl. Acad. Sci. U. S. A., 116(10):4156–4165, March 2019.
- Cross-Fitting and fast remainder rates for semiparametric estimation. January 2018.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 2020.
- Some methods for heterogeneous treatment effect estimation in high dimensions. Stat. Med., 37(11):1767–1787, May 2018.
- Higher order influence functions and minimax estimation of nonlinear functionals. May 2008.
- Semiparametric efficiency in multivariate regression models with missing data. J. Am. Stat. Assoc., 90(429):122–129, 1995.
- On profile likelihood: Comment. J. Am. Stat. Assoc., 95(450):477–482, 2000.
- Robinson, P. M. Root-N-Consistent semiparametric regression. Econometrica, 56(4):931–954, 1988.
- A general imputation methodology for nonparametric regression with censored data. 2005.
- A doubly robust censoring unbiased transformation. Int. J. Biostat., 3(1), 2007.
- Adjusting for nonignorable Drop-Out using semiparametric nonresponse models. J. Am. Stat. Assoc., 94(448):1096–1120, December 1999.
- Schick, A. On asymptotically efficient estimation in semiparametric models. Ann. Stat., 14(3):1139–1151, 1986.
- Debiased machine learning of conditional average treatment effects and other causal functions. Econom. J., 24(2):264–289, August 2020.
- Estimation and inference on heterogeneous treatment effects in high-dimensional dynamic panels under weak dependence. December 2017.
- lightgbm: Light gradient boosting machine. https://CRAN.R-project.org/package=lightgbm, 2023.
- Causal inference and machine learning in practice with EconML and CausalML: Industrial use cases at microsoft, TripAdvisor, uber. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, KDD ’21, pp. 4072–4073, New York, NY, USA, August 2021. Association for Computing Machinery.
- A simple method for estimating interactions between a treatment and a large number of covariates. J. Am. Stat. Assoc., 109(508):1517–1532, October 2014.
- Tropp, J. A. An introduction to matrix concentration inequalities. Foundations and Trends® in Machine Learning, 8(1-2):1–230, 2015.
- Tsybakov, A. B. Introduction to nonparametric estimation. Springer Series in Statistics, 2009.
- van der Laan, M. J. Statistical inference for variable importance. Int. J. Biostat., 2(1), February 2006.
- A survey on causal inference. ACM Trans. Knowl. Discov. Data, 15(5):1–46, May 2021.
- Selective inference for effect modification via the lasso. J. R. Stat. Soc. Series B Stat. Methodol., 84(2):382–413, April 2022.
- Estimating individualized treatment rules using outcome weighted learning. J. Am. Stat. Assoc., 107(449):1106–1118, September 2012.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.