Optimally weighted average derivative effects (2308.05456v2)
Abstract: Weighted average derivative effects (WADEs) are nonparametric estimands with uses in economics and causal inference. Debiased WADE estimators typically require learning the conditional mean outcome as well as a Riesz representer (RR) that characterises the requisite debiasing corrections. RR estimators for WADEs often rely on kernel estimators, introducing complicated bandwidth-dependant biases. In our work we propose a new class of RRs that are isomorphic to the class of WADEs and we derive the WADE weight that is optimal, in the sense of having minimum nonparametric efficiency bound. Our optimal WADE estimators require estimating conditional expectations only (e.g. using machine learning), thus overcoming the limitations of kernel estimators. Moreover, we connect our optimal WADE to projection parameters in partially linear models. We ascribe a causal interpretation to WADE and projection parameters in terms of so-called incremental effects. We propose efficient estimators for two WADE estimands in our class, which we evaluate in a numerical experiment and use to determine the effect of Warfarin dose on blood clotting function.
- Estimating Conditional Average Treatment Effects. Journal of Business and Economic Statistics, 33(4):485–505.
- Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences of the United States of America, 113(27):7353–7360.
- Policy Learning With Observational Data. Econometrica, 89(1):133–161.
- The Fundamental Limits of Structure-Agnostic Functional Estimation. arXiv (2305.04116).
- Banerjee, A. N. (2007). A method of estimating the average derivative. Journal of Econometrics, 136(1):65–88.
- Deep determinism and the assessment of mechanistic interaction. Biostatistics, 14(3):502–513.
- Models as approximations I: Consequences illustrated with linear regression. Statistical Science, 34(4):523–544.
- Variance function estimation in multivariate nonparametric regression with fixed design. Journal of Multivariate Analysis, 100(1):126–136.
- Robust data-driven inference for density-weighted average derivatives. Journal of the American Statistical Association, 105(491):1070–1083.
- Generalized jackknife estimators of weighted average derivatives. Journal of the American Statistical Association, 108(504):1243–1256.
- Estimation of a non-parametric variable importance measure of a continuous exposure. Electronic Journal of Statistics, 6:1059–1099.
- Personalized Dose Finding Using Outcome Weighted Learning. Journal of the American Statistical Association, 111(516):1509–1521.
- Double/debiased machine learning for treatment and structural parameters. Econometrics Journal, 21(1):C1–C68.
- Automatic Debiased Machine Learning via Neural Nets for Generalized Linear Regression. arXiv (2104.14737).
- RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests. Proceedings of Machine Learning Research, 162:3901–3914.
- Adversarial Estimation of Riesz Representers. arXiv (2101.00009).
- Moving the Goalposts: Addressing Limited Overlap in the Estimation. National Bureau of Economic Research.
- Dealing with limited overlap in estimation of average treatment effects. Biometrika, 96(1):187–199.
- Visually Communicating and Teaching Intuition for Influence Functions. The American Statistician, 75(2):162–172.
- Empirical Evidence on the Law of Demand. Econometrica, 59(6):1525.
- Investigating smooth multiple regression by the method of average derivatives. Journal of the American Statistical Association, 84(408):986–995.
- Demystifying Statistical Learning Based on Efficient Influence Functions. The American Statistician, 76(3):292–304.
- Debiased Inference of Average Partial Effects in Single-Index Models: Comment on Wooldridge and Zhu. Journal of Business & Economic Statistics, 38(1):19–24.
- Augmented minimax linear estimation. The Annals of Statistics, 49(6).
- An Approach to Nonparametric Inference on the Causal Dose Response Function. arXiv (2306.07736).
- Identification and Estimation of Triangular Simultaneous Equations Models Without Additivity. Econometrica, 77(5):1481–1512.
- International Warfarin Pharmacogenetics Consortium (2009). Estimation of the Warfarin Dose with Clinical and Pharmacogenetic Data. New England Journal of Medicine, 360(8):753–764.
- Kallus, N. (2021). More Efficient Policy Learning via Optimal Retargeting. Journal of the American Statistical Association, 116(534):646–658.
- Interval estimation of individual-level causal effects under unobserved confounding. arXiv, pages 1–32.
- Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, 22(4):523–539.
- Kennedy, E. H. (2020). Optimal doubly robust estimation of heterogeneous causal effects. arXiv (2004.14497).
- Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 79(4):1229–1245.
- Average partial effect estimation using double machine learning. arXiv (2308.09207).
- Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences of the United States of America, 116(10):4156–4165.
- Balancing Covariates via Propensity Score Weighting. Journal of the American Statistical Association, 113(521):390–400.
- Nonparametric causal effects based on marginal structural models. Journal of Statistical Planning and Inference, 137(2):419–434.
- Large Sample Estimation Testing [Weighted average derivative estimation pages 2212-2214]. Handbook of Econometrics, 4:2113–2245.
- Cross-fitting and fast remainder rates for semiparametric estimation. arXiv (1801.09138), pages 1–43.
- Efficiency of Weighted Average Derivative Estimators and Index Models. Econometrica, 61(5):1199.
- Quasi-oracle estimation of heterogeneous treatment effects. Biometrika, 108(2):299–319.
- Semiparametric Estimation of Index Coefficients. Econometrica, 57(6):1403.
- Higher order influence functions and minimax estimation of nonlinear functionals. Probability and Statistics: Essays in Honor of David A. Freedman, 2:335–421.
- Estimation of Regression Coefficients When Some Regressors Are Not Always Observed. Journal of the American Statistical Association, 89(427):846.
- Causal Inference for Complex Longitudinal Data : The Continuous Case. Annals of Statistics, 29(6):1785–1811.
- Robinson, P. M. (1988). Root-N-Consistent Semiparametric Regression. Econometrica, 56(4):931.
- Incremental causal effects. arXiv (1907.13258).
- Doubly Robust Estimation of Optimal Dosing Strategies. Journal of the American Statistical Association, 116(533):256–268.
- The hardness of conditional independence testing and the generalised covariance measure. The Annals of Statistics, 48(3).
- Optimal estimation of variance in nonparametric regression with random design. The Annals of Statistics, 48(6):3589–3618.
- Super learner. Statistical Applications in Genetics and Molecular Biology, 6(1).
- van der Vaart, A. W. (1998). Functional Delta Method. In Asymptotic Statistics, pages 291–303. Cambridge University Press.
- Inference for causal interactions for continuous exposures under dichotomization. Biometrics, 67(4):1414–1421.
- Assumption-lean inference for generalised linear model parameters. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 84(3):657–685.
- Adaptive estimation of high-dimensional signal-to-noise ratios. Bernoulli, 24(4B):3683–3710.
- Estimation and Inference of Heterogeneous Treatment Effects using Random Forests. Journal of the American Statistical Association, 113(523):1228–1242.
- Reward ignorant modeling of dynamic treatment regimes. Biometrical Journal, 60(5):991–1002.
- Effect of mean on variance function estimation in nonparametric regression. The Annals of Statistics, 36(2):646–664.
- Smoothing Parameter and Model Selection for General Smooth Models. Journal of the American Statistical Association, 111(516):1548–1563.
- Inference in Approximately Sparse Correlated Random Effects Probit Models With Panel Data. Journal of Business and Economic Statistics, 38(1):1–18.
- Ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software, 77(1).
- A flexible framework for nonparametric graphical modeling that accommodates machine learning. 37th International Conference on Machine Learning, ICML 2020, PartF168147-14:10373–10382.
- Cross-Validated Targeted Minimum-Loss-Based Estimation. In Targeted Learning, pages 459–474. Springer New York, New York, NY.
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days freePaper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.