Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Targeted Machine Learning for Average Causal Effect Estimation Using the Front-Door Functional (2312.10234v1)

Published 15 Dec 2023 in stat.ME and stat.ML

Abstract: Evaluating the average causal effect (ACE) of a treatment on an outcome often involves overcoming the challenges posed by confounding factors in observational studies. A traditional approach uses the back-door criterion, seeking adjustment sets to block confounding paths between treatment and outcome. However, this method struggles with unmeasured confounders. As an alternative, the front-door criterion offers a solution, even in the presence of unmeasured confounders between treatment and outcome. This method relies on identifying mediators that are not directly affected by these confounders and that completely mediate the treatment's effect. Here, we introduce novel estimation strategies for the front-door criterion based on the targeted minimum loss-based estimation theory. Our estimators work across diverse scenarios, handling binary, continuous, and multivariate mediators. They leverage data-adaptive machine learning algorithms, minimizing assumptions and ensuring key statistical properties like asymptotic linearity, double-robustness, efficiency, and valid estimates within the target parameter space. We establish conditions under which the nuisance functional estimations ensure the root n-consistency of ACE estimators. Our numerical experiments show the favorable finite sample performance of the proposed estimators. We demonstrate the applicability of these estimators to analyze the effect of early stage academic performance on future yearly income using data from the Finnish Social Science Data Archive.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. A. Balke and J. Pearl. Counterfactual probabilities: Computational methods, bounds and applications. In Proceedings of UAI-94, pages 46–54, 1994.
  2. The paper of how: Estimating treatment effects using the front-door criterion. Technical report, Working paper, 2019.
  3. D. Benkeser and M. Van Der Laan. The highly adaptive lasso estimator. In 2016 IEEE international conference on data science and advanced analytics (DSAA), pages 689–696. IEEE, 2016.
  4. R. Bhattacharya and R. Nabi. On testability of the front-door model via verma constraints. In Uncertainty in Artificial Intelligence, pages 202–212. PMLR, 2022.
  5. Semiparametric inference for causal effects in graphical models with hidden variables. Journal of Machine Learning Research, 23:1–76, 2022.
  6. Efficient and adaptive estimation for semiparametric models, volume 4. Johns Hopkins University Press Baltimore, 1993.
  7. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 2017.
  8. Robust inference on population indirect causal effects: The generalized front-door criterion. Journal of the Royal Statistical Society, Series B, 2019.
  9. A. Glynn and K. Kashin. Front-door versus back-door adjustment with unmeasured confounding: Bias formulas for front-door and hybrid adjustments. In 71st Annual Conference of the Midwest Political Science Association, volume 3, 2013.
  10. A. N. Glynn and K. Kashin. Front-door versus back-door adjustment with unmeasured confounding: Bias formulas for front-door and hybrid adjustments with application to a job training program. Journal of the American Statistical Association, 113(523):1040–1049, 2018.
  11. J. Hahn. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, pages 315–331, 1998.
  12. T. Hayfield and J. S. Racine. Nonparametric econometrics: The np package. Journal of statistical software, 27:1–32, 2008.
  13. Estimating causal effects from epidemiological data. Journal of Epidemiology & Community Health, 60(7):578–586, 2006.
  14. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 71(4):1161–1189, 2003.
  15. Y. Huang and M. Valtorta. Pearl’s calculus of interventions is complete. In Twenty Second Conference On Uncertainty in Artificial Intelligence, 2006.
  16. K. Jorma. Life course 1971-2002 [dataset]. version 2.0, 2018. Finnish Social Science Data Archive [distributor]. http://urn.fi/urn:nbn:fi:fsd:T-FSD2076.
  17. A least-squares approach to direct importance estimation. The Journal of Machine Learning Research, 10:1391–1445, 2009.
  18. E. H. Kennedy. Semiparametric doubly robust targeted double machine learning: a review. arXiv preprint arXiv:2203.06469, 2022.
  19. C. F. Manski. Nonparametric bounds on treatment effects. The American Economic Review, 80(2):319–323, 1990.
  20. J. Neyman. Sur les applications de la thar des probabilities aux experiences agaricales: Essay des principle. excerpts reprinted (1990) in English. Statistical Science, 5:463–472, 1923.
  21. J. Pearl. Causal diagrams for empirical research. Biometrika, 82(4):669–688, 1995a.
  22. J. Pearl. Causal diagrams for empirical research. Biometrika, 82(4):669–709, 1995b. URL citeseer.ist.psu.edu/55450.html.
  23. J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2 edition, 2009. ISBN 978-0521895606.
  24. Single world intervention graphs (SWIGs): A unification of the counterfactual and graphical approaches to causality. 2013.
  25. Nested markov properties for acyclic directed mixed graphs. arXiv preprint arXiv:1701.06686, 2017.
  26. J. M. Robins. A new approach to causal inference in mortality studies with sustained exposure periods – application to control of the healthy worker survivor effect. Mathematical Modeling, 7:1393–1512, 1986.
  27. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89(427):846–866, 1994a.
  28. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89:846–866, 1994b.
  29. Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In Statistical models in epidemiology, the environment, and clinical trials, pages 1–94. Springer, 2000.
  30. The central role of the propensity score in observational studies for causal effects. Biometrika, 70:41–55, 1983.
  31. D. B. Rubin. Estimating causal effects of treatments in randomized and non-randomized studies. Journal of Educational Psychology, 66:688–701, 1974.
  32. Semiparametric sensitivity analysis: Unmeasured confounding in observational studies. arXiv preprint arXiv:2104.08300, 2021.
  33. I. Shpitser and J. Pearl. Identification of joint interventional distributions in recursive semi-Markovian causal models. In Proceedings of the Twenty-First National Conference on Artificial Intelligence (AAAI-06). AAAI Press, Palo Alto, 2006.
  34. Direct importance estimation with model selection and its application to covariate shift adaptation. Advances in neural information processing systems, 20, 2007.
  35. Dimensionality reduction for density ratio estimation in high-dimensional spaces. Neural Networks, 23(1):44–59, 2010.
  36. J. Tian and J. Pearl. A general identification condition for causal effects. In Eighteenth National Conference on Artificial Intelligence, pages 567–573, 2002. ISBN 0-262-51129-0.
  37. A. Tsiatis. Semiparametric theory and missing data. Springer Science & Business Media, 2007.
  38. M. J. van der Laan and D. Rubin. Targeted maximum likelihood learning. The International Journal of Biostatistics, 2(1), 2006.
  39. Super learner. Statistical applications in genetics and molecular biology, 6(1), 2007.
  40. Targeted learning: causal inference for observational and experimental data, volume 4. Springer, 2011.
  41. A. van der Vaart and J. A. Wellner. Empirical processes. In Weak Convergence and Empirical Processes: With Applications to Statistics, pages 127–384. Springer, 2023.
  42. A. W. van der Vaart. Asymptotic Statistics, volume 3. Cambridge University Press, 2000.
  43. T. S. Verma and J. Pearl. Equivalence and synthesis of causal models. Technical Report R-150, Department of Computer Science, University of California, Los Angeles, 1990.
  44. Causal effects of intervening variables in settings with unmeasured confounding. arXiv preprint arXiv:2305.00349, 2023.
  45. Relative density-ratio estimation for robust distribution comparison. Neural computation, 25(5):1324–1370, 2013.
  46. Asymptotic theory for cross-validated targeted maximum likelihood estimation. 2010.
Citations (2)

Summary

We haven't generated a summary for this paper yet.