Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Testing the identification of causal effects in observational data (2203.15890v4)

Published 29 Mar 2022 in econ.EM and stat.ME

Abstract: This study demonstrates the existence of a testable condition for the identification of the causal effect of a treatment on an outcome in observational data, which relies on two sets of variables: observed covariates to be controlled for and a suspected instrument. Under a causal structure commonly found in empirical applications, the testable conditional independence of the suspected instrument and the outcome given the treatment and the covariates has two implications. First, the instrument is valid, i.e. it does not directly affect the outcome (other than through the treatment) and is unconfounded conditional on the covariates. Second, the treatment is unconfounded conditional on the covariates such that the treatment effect is identified. We suggest tests of this conditional independence based on machine learning methods that account for covariates in a data-driven way and investigate their asymptotic behavior and finite sample performance in a simulation study. We also apply our testing approach to evaluating the impact of fertility on female labor supply when using the sibling sex ratio of the first two children as supposed instrument, which by and large points to a violation of our testable implication for the moderate set of socio-economic covariates considered.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (71)
  1. Children and their parents labor supply: Evidence from exogeneous variation in family size. American Economic Review 88, 450–477.
  2. Identification of causal effects using instrumental variables. Journal of American Statistical Association 91, 444–472 (with discussion).
  3. Treatment effect heterogeneity in theory and practice. The Economic Journal 114, C52–C83.
  4. Wanna get away? regression discontinuity estimation of exam school effects away from the cutoff. Journal of the American Statistical Association 110, 1331–1344.
  5. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences 113, 7353–7360.
  6. Generalized random forests. The Annals of Statistics 47, 1148–1178.
  7. Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B 57, 289–300.
  8. External validity in fuzzy regression discontinuity designs. NBER working paper 20773 .
  9. Consistent model specification tests. Journal of Econometrics 20, 105–134.
  10. Nothing to see here? non-inferiority approaches to parallel trends and other model assumptions. arXiv preprint arXiv:1805.03273 .
  11. Simple tests for selection bias: Learning more from instrumental variables. IZA Discussion Paper No 9346 .
  12. The causalweight package for causal inference in r. SES Working Paper 493, University of Fribourg .
  13. Classification and Regression Trees. Wadsworth, Belmont, California.
  14. Testability of reverse causality without exogeneous variation. arXiv preprint 2107.05936 .
  15. Beyond late with a discrete instrument. Journal of Political Economy 125, 985 – 1039.
  16. A dummy test of identi?cation in models with bunching. Finance and Economics Discussion Series 2021-068, Washington: Board of Governors of the Federal Reserve System .
  17. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, C1–C68.
  18. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, C1–C68. doi:10.1111/ectj.12097.
  19. Post-selection and post-regularization inference in linear models with many controls and instruments. American Economic Review 105, 486–90.
  20. Planning of Experiments. Wiley, New York.
  21. Causal Inference: The Mixtape. Yale University Press, New Haven.
  22. Testing the unconfoundedness assumption via inverse probability weighted estimators of (L)ATT. Journal of Business & Economic Statistics 32, 395–415.
  23. Consistent model specification tests: omitted variables and semiparametric functional forms. Econometrica: Journal of the econometric society , 865–890.
  24. Instrument validity tests with causal forests. Journal of Business & Economic Statistics 40, 605–614.
  25. Efficient Semiparametric Estimation of Quantile Treatment Effects. Econometrica 75, 259–276.
  26. Review of causal discovery methods based on graphical models. Frontiers in Genetics 10, 1–15.
  27. Comparing nonparametric versus parametric regression fits. The Annals of Statistics , 1926–1947.
  28. Specification tests in econometrics. Econometrica 46, 1251–71.
  29. Sample selection bias as a specification error. Econometrica 47, 153–161.
  30. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71, 1161–1189.
  31. Consistent specification testing via nonparametric series regression. Econometrica: Journal of the Econometric Society , 1133–1159.
  32. Testing a parametric model against a semiparametric alternative. Econometric theory 10, 821–848.
  33. A generalization of sampling without replacement from a finite population. Journal of American Statistical Association 47, 663–685.
  34. A simple test for the ignorability of non-compliance in experiments. Economics Letters 120, 389–391.
  35. Testing instrument validity for late identification based on inequality moment constraints. Review of Economics and Statistics 97, 398–411.
  36. Nonparametric estimation of average treatment effects under exogeneity: a review. The Review of Economics and Statistics 86, 4–29.
  37. Identification and estimation of local average treatment effects. Econometrica 62, 467–475.
  38. Recent developments in the econometrics of program evaluation. Journal of Economic Literature 47, 5–86.
  39. Causal structure learning and inference: A selective review. Quality Technology & Quantitative Management 11, 3–21.
  40. A test for instrument validity. Econometrica 83, 2043–2063.
  41. A note on high-dimensional confidence regions. doi:10.48550/ARXIV.2105.09028.
  42. Uniform Inference in high-Dimensional Gaussian Graphical Models. Biometrika .
  43. Super learner. Statistical Applications in Genetics and Molecular Biology 6.
  44. Sibling size and investment in children�s education: an asian instrument. Journal of Population Economics 21, 855–875.
  45. Causal rule ensemble: Interpretable inference of heterogeneous treatment effects. arXiv preprint 2009.09036 .
  46. Classification and regression by randomforest. R News 2, 18–22.
  47. Testing for the unconfoundedness assumption using an instrumental assumption. Journal of Causal Inference 2, 187–199.
  48. A theoretical study of y structures for causal discovery. arXiv preprint 1206.6853 .
  49. Problems in the analysis of survey data, and a proposal. Journal of the American Statistical Association 58, 415–434.
  50. Testing late assumptions. The Review of Economics and Statistics 99, 305–313.
  51. On the application of probability theory to agricultural experiments. essay on principles. Statistical Science Reprint, 5, 463–480.
  52. Optimal asymptotic tests of composite statistical hypotheses. Wiley. pp. 416–444.
  53. Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, San Mateo.
  54. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge.
  55. Elements of causal inference: foundations and learning algorithms. The MIT Press.
  56. Consistent significance testing for nonparametric regression. Journal of Business & Economic Statistics 15, 369–378.
  57. Testing the significance of categorical predictor variables in nonparametric regression models. Econometric Reviews 25, 523–544.
  58. Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association 90, 122–129.
  59. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association 90, 846–866.
  60. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 41–55.
  61. Constructing a control group using multivariate matched sampling methods that incorporate the propensity score. The American Statistician 39, 33–38.
  62. Natural ”natural experiments” in economics. Journal of Economic Literature 38, 827–874.
  63. Comment on ’randomization analysis of experimental data: The fisher randomization test’ by d. basu. Journal of American Statistical Association 75, 591–593.
  64. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology 66, 688–701.
  65. Debiased machine learning of conditional average treatment effects and other causal functions. The Econometrics Journal 24, 264–289.
  66. A conditional independence test for causality in econometrics. arXiv preprint 2107.09765 .
  67. Causation, prediction, and search. Springer.
  68. grf: Generalized random forests. R package .
  69. Regresson shrinkage and selection via the lasso. Journal of the Royal Statistical Society 58, 267–288.
  70. Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association 113, 1228–1242.
  71. A test for functional form against nonparametric alternatives. Econometric Theory 8, 452–475.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com