Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Double Machine Learning for Static Panel Models with Fixed Effects (2312.08174v5)

Published 13 Dec 2023 in econ.EM, cs.LG, and stat.ML

Abstract: Recent advances in causal inference have seen the development of methods which make use of the predictive power of machine learning algorithms. In this paper, we develop novel double machine learning (DML) procedures for panel data in which these algorithms are used to approximate high-dimensional and nonlinear nuisance functions of the covariates. Our new procedures are extensions of the well-known correlated random effects, within-group and first-difference estimators from linear to nonlinear panel models, specifically, Robinson (1988)'s partially linear regression model with fixed effects and unspecified nonlinear confounding. Our simulation study assesses the performance of these procedures using different machine learning algorithms. We use our procedures to re-estimate the impact of minimum wage on voting behaviour in the UK. From our results, we recommend the use of first-differencing because it imposes the fewest constraints on the distribution of the fixed effects, and an ensemble learning strategy to ensure optimum estimator accuracy.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113(27):7353–7360.
  2. Generalized random forests. The Annals of Statistics, 47(2):1148 – 1178.
  3. The value added of machine learning to causal inference: Evidence from revisited studies. arXiv preprint arXiv:2101.00878.
  4. Inference in high-dimensional panel models with an application to gun control. Journal of Business & Economic Statistics, 34(4):590–605.
  5. Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13(2).
  6. Microeconometrics: Methods and Applications. Cambridge University Press.
  7. Seeing beyond the trees: Using machine learning to estimate the impact of minimum wages on labor market outcomes. Journal of Labor Economics, 40(S1):S203–S247.
  8. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68.
  9. Automatic debiased machine learning of causal and structural effects. Econometrica, 90(3):967–1027.
  10. Using causal forests to predict treatment heterogeneity: An application to summer jobs. American Economic Review, 107(5):546–550.
  11. Di Francesco, R. (2022). Aggregation trees. CEIS Working Paper.
  12. Di Francesco, R. (2023). Ordered correlation forest. arXiv preprint arXiv:2309.08755.
  13. Minimum wage and tolerance for high incomes. European Economic Review, 155:104445.
  14. The elements of statistical learning: data mining, inference, and prediction, volume 2. Springer.
  15. Effect or treatment heterogeneity? policy evaluation with aggregated and disaggregated treatments. arXiv preprint arXiv:2110.01427.
  16. Estimating continuous treatment effects in panel data using machine learning with an agricultural application. arXiv preprint arXiv:2207.08789.
  17. Knaus, M. C. (2022). Double machine learning-based programme evaluation under unconfoundedness. The Econometrics Journal, 25(3):602–627.
  18. Heterogeneous employment effects of job search programs: A machine learning approach. Journal of Human Resources, 57(2):597–636.
  19. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the national academy of sciences, 116(10):4156–4165.
  20. Lechner, M. (2019). Modified causal forests for estimating heterogeneous causal effects. arXiv preprint arXiv:1812.09487.
  21. Modified causal forest. arXiv preprint arXiv:2209.03744.
  22. Random forest estimation of the ordered choice model. arXiv preprint arXiv:1907.02436.
  23. Hyperparameter tuning and model evaluation in causal effect estimation. arXiv preprint arXiv:2303.01412.
  24. Mundlak, Y. (1978). On the pooling of time series and cross section data. Econometrica, pages 69–85.
  25. Qusi-oracle estimation of heterogeneous treatment effects. Biometrika, pages 299–319.
  26. Robinson, P. M. (1988). Root-n-consistent semiparametric regression. Econometrica: Journal of the Econometric Society, pages 931–954.
  27. All models are wrong, but which are useful? comparing parametric and nonparametric estimation of causal effects in finite samples. Journal of Causal Inference, 11(1):20230022.
  28. Re-em trees: a data mining approach for longitudinal and clustered data. Machine learning, 86:169–207.
  29. Inference on heterogeneous treatment effects in high-dimensional dynamic panels under weak dependence. Quantitative Economics, 14(2):471–510.
  30. Using machine learning to identify heterogeneous impacts of agri-environment schemes in the eu: a case study. European Review of Agricultural Economics, 49(4):723–759.
  31. Strittmatter, A. (2023). What is the value added by using causal machine learning methods in a welfare experiment evaluation? Labour Economics, 84:102412.
  32. University of Essex, Institute for Social and Economic Research (2018). British Household Panel Survey: Waves 1-18, 1991-2009. [data collection]. 8th Edition. UK Data Service. SN: 5151, DOI: http://doi.org/10.5255/UKDA-SN-5151-2.
  33. Ensembles of learning machines. In Neural Nets: 13th Italian Workshop on Neural Nets, WIRN VIETRI 2002 Vietri sul Mare, Italy, May 30–June 1, 2002 Revised Papers 13, pages 3–20. Springer.
  34. Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523):1228–1242.
  35. Adaptive concentration of regression trees, with application to random forests. arXiv preprint arXiv:1503.06388.
  36. Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data. MIT press.
Citations (1)

Summary

We haven't generated a summary for this paper yet.