Doubly Robust Proximal Causal Learning for Continuous Treatments
Abstract: Proximal causal learning is a promising framework for identifying the causal effect under the existence of unmeasured confounders. Within this framework, the doubly robust (DR) estimator was derived and has shown its effectiveness in estimation, especially when the model assumption is violated. However, the current form of the DR estimator is restricted to binary treatments, while the treatment can be continuous in many real-world applications. The primary obstacle to continuous treatments resides in the delta function present in the original DR estimator, making it infeasible in causal effect estimation and introducing a heavy computational burden in nuisance function estimation. To address these challenges, we propose a kernel-based DR estimator that can well handle continuous treatments. Equipped with its smoothness, we show that its oracle form is a consistent approximation of the influence function. Further, we propose a new approach to efficiently solve the nuisance functions. We then provide a comprehensive convergence analysis in terms of the mean square error. We demonstrate the utility of our estimator on synthetic datasets and real-world applications.
- The kernel mixture network: A nonparametric method for conditional density estimation of continuous random variables. arXiv preprint arXiv:1705.07111, 2017.
- Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4):962–973, 2005.
- Christopher M Bishop. Mixture density networks. 1994.
- Toward computerized efficient estimation in infinite-dimensional models. Journal of the American Statistical Association, 2018.
- Linear inverse problems in structural econometrics estimation based on spectral decomposition and regularization. Handbook of econometrics, 6:5633–5751, 2007.
- Estimation of nonparametric conditional moment models with possibly nonsmooth generalized residuals. Econometrica, 80(1):277–321, 2012.
- Yen-Chi Chen. A tutorial on kernel density estimation and recent advances. Biostatistics & Epidemiology, 1(1):161–187, 2017.
- Automatic debiased machine learning of causal and structural effects. Econometrica, 90(3):967–1027, 2022.
- Double debiased machine learning nonparametric inference with continuous treatments. arXiv preprint arXiv:2004.03036, 2020.
- Semiparametric proximal causal inference. Journal of the American Statistical Association, pp. 1–12, 2023.
- Minimax estimation of conditional moment models. Advances in Neural Information Processing Systems, 33:12248–12262, 2020.
- Density estimation using real nvp. arXiv preprint arXiv:1605.08803, 2016.
- The impact of legalized abortion on crime. The Quarterly Journal of Economics, 116(2):379–420, 2001.
- A tutorial on particle filtering and smoothing: Fifteen years later. Handbook of nonlinear filtering, 12(656-704):3, 2009.
- Neural spline flows. Advances in neural information processing systems, 32, 2019.
- Orthogonal statistical learning. arXiv preprint arXiv:1901.09036, 2019.
- Minimax kernel machine learning for a class of doubly robust functionals with application to proximal causal inference. In International Conference on Artificial Intelligence and Statistics, pp. 7210–7239. PMLR, 2022.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- Jens Hainmueller. Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political analysis, 20(1):25–46, 2012.
- Causal mechanisms in the social sciences. Annual review of sociology, 36:49–67, 2010.
- Jennifer L Hill. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics, 20(1):217–240, 2011.
- The propensity score with continuous treatments. Applied Bayesian modeling and causal inference from incomplete-data perspectives, 226164:73–84, 2004.
- A bayesian nonparametric method to adjust for unmeasured confounding with negative controls. arXiv preprint arXiv:2309.02631, 2023.
- The influence function of semiparametric estimators. Quantitative Economics, 13(1):29–61, 2022.
- Covariate balancing propensity score. Journal of the Royal Statistical Society Series B: Statistical Methodology, 76(1):243–263, 2014.
- Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association, 99(467):854–866, 2004.
- Guido W Imbens. The role of the propensity score in estimating dose-response functions. Biometrika, 87(3):706–710, 2000.
- Guido W Imbens. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and statistics, 86(1):4–29, 2004.
- A new central limit theorem for the augmented ipw estimator: Variance inflation, cross-fit covariance and beyond. arXiv preprint arXiv:2205.10198, 2022.
- Doubly robust off-policy value and gradient estimation for deterministic policies. Advances in Neural Information Processing Systems, 33:10420–10430, 2020.
- Policy evaluation and optimization with continuous treatments. In International conference on artificial intelligence and statistics, pp. 1243–1251. PMLR, 2018.
- Causal inference under unmeasured confounding with negative controls: A minimax learning approach. arXiv preprint arXiv:2103.14029, 2021.
- Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. 2007.
- Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society Series B: Statistical Methodology, 79(4):1229–1245, 2017.
- Sylvia Klosin. Automatic double machine learning for continuous treatment effects. arXiv preprint arXiv:2104.10334, 2021.
- Deep learning methods for proximal inference via maximum moment restriction. Advances in Neural Information Processing Systems, 35:11189–11201, 2022.
- Measurement bias and effect restoration in causal inference. Biometrika, 101(2):423–437, 2014.
- Proximal causal learning with kernels: Two-stage estimation and moment restriction. In International conference on machine learning, pp. 7512–7523. PMLR, 2021.
- Identifying causal effects with proxy variables of an unmeasured confounder. Biometrika, 105(4):987–993, 2018a.
- A confounding bridge approach for double negative control inference on causal effects. arXiv preprint arXiv:1808.04945, 2018b.
- Kernel conditional moment test via maximum moment restriction. In Conference on Uncertainty in Artificial Intelligence, pp. 41–50. PMLR, 2020a.
- Dual instrumental variable regression. Advances in Neural Information Processing Systems, 33:2710–2721, 2020b.
- Elements of causal inference: foundations and learning algorithms. The MIT Press, 2017.
- Proximal learning for individualized treatment regimes under unmeasured confounding. Journal of the American Statistical Association, pp. 1–14, 2023.
- Variational inference with normalizing flows. In International conference on machine learning, pp. 1530–1538. PMLR, 2015.
- Comment: Performance of double-robust estimators when” inverse probability” weights are highly variable. Statistical Science, 22(4):544–559, 2007.
- A generalized representer theorem. In International conference on computational learning theory, pp. 416–426. Springer, 2001.
- Multiply robust causal inference with double-negative control adjustment for categorical unmeasured confounding. Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(2):521–540, 2020.
- Rahul Singh. Kernel methods for unobserved confounding: Negative controls, proxies, and instruments. arXiv preprint arXiv:2012.10315, 2020.
- Learning structured output representation using deep conditional generative models. Advances in neural information processing systems, 28, 2015.
- An introduction to proximal causal learning. arXiv preprint arXiv:2009.10982, 2020.
- Stefan Tübbicke. Entropy balancing for continuous treatments. Journal of Econometric Methods, 11(1):71–89, 2021.
- Hal R Varian. Causal inference in economics and marketing. Proceedings of the National Academy of Sciences, 113(27):7310–7315, 2016.
- Martin J Wainwright. High-dimensional statistics: A non-asymptotic viewpoint, volume 48. Cambridge university press, 2019.
- Estimating heterogeneous effects of continuous exposures using bayesian tree ensembles: revisiting the impact of abortion rates on crime. arXiv preprint arXiv:2007.09845, 2020.
- Deep proxy causal learning and its application to confounded bandit policy evaluation. Advances in Neural Information Processing Systems, 34:26264–26275, 2021.
- Causal inference in the age of decision medicine. Journal of data mining in genomics & proteomics, 6(1), 2015.
- Proximal causal inference for complex longitudinal studies. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(3):684–704, 2023.
- Maximum moment restriction for instrumental variable regression. arXiv preprint arXiv:2010.07684, 3(4), 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.