Arellano-Bond LASSO Estimator for Dynamic Linear Panel Models (2402.00584v4)
Abstract: The Arellano-Bond estimator is a fundamental method for dynamic panel data models, widely used in practice. However, the estimator is severely biased when the data's time series dimension $T$ is long due to the large degree of overidentification. We show that weak dependence along the panel's time series dimension naturally implies approximate sparsity of the most informative moment conditions, motivating the following approach to remove the bias: First, apply LASSO to the cross-section data at each time period to construct most informative (and cross-fitted) instruments, using lagged values of suitable covariates. This step relies on approximate sparsity to select the most informative instruments. Second, apply a linear instrumental variable estimator after first differencing the dynamic structural equation using the constructed instruments. Under weak time series dependence, we show the new estimator is consistent and asymptotically normal under much weaker conditions on $T$'s growth than the Arellano-Bond estimator. Our theory covers models with high dimensional covariates, including multiple lags of the dependent variable, common in modern applications. We illustrate our approach by applying it to weekly county-level panel data from the United States to study opening K-12 schools and other mitigation policies' short and long-term effects on COVID-19's spread.
- The time series and cross-section asymptotics of dynamic panel data estimators. Econometrica, 71(4):1121–1159.
- Estimation of the parameters of a single equation in a complete system of stochastic equations. The Annals of mathematical statistics, 20(1):46–63.
- Jackknife instrumental variables estimation. Journal of Applied Econometrics, 14(1):57–67.
- Split-sample instrumental variables estimates of the return to schooling. Journal of Business & Economic Statistics, 13(2):225–235.
- Arellano, M. (2003). Panel Data Econometrics. Oxford University Press.
- Some tests of specification for panel data: Monte carlo evidence and an application to employment equations. Review of Economic Studies, 58(2):277–297.
- Sparse models and methods for optimal instruments with an application to eminent domain. Econometrica, 80(6):2369–2429.
- Least squares after model selection in high-dimensional sparse models. Bernoulli, 19(2):521–547.
- High-dimensional econometrics and regularized GMM. arXiv preprint arXiv:1806.01888.
- Inference for high-dimensional sparse econometric models. arXiv preprint arXiv:1201.0220.
- Simultaneous analysis of lasso and Dantzig selector. The Annals of statistics, 37(4):1705–1732.
- Initial conditions and moment restrictions in dynamic panel data models. Journal of econometrics, 87(1):115–143.
- Bond, S. R. (2002). Dynamic panel data models: a guide to micro data methods and practice. Portuguese Economic Journal, 1(2):141–162.
- Burkholder, D. L. (1988). Sharp inequalities for martingales and stochastic integrals. Astérisque, (157-158):75–94.
- High dimensional generalized empirical likelihood for moment restrictions with dependent data. Journal of Econometrics, 185(1):283–304.
- High-dimensional empirical likelihood inference. Biometrika, 108(1):127–147.
- Inference of breakpoints in high-dimensional time series. Journal of the American Statistical Association, 117(540):1951–1963.
- Mastering panel metrics: causal impact of democracy on growth. AEA Papers and Proceedings, 109:77–82.
- Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68.
- LASSO-driven inference in time and space. The Annals of Statistics, 49(3):1702–1735.
- The association of opening K-12 schools with the spread of COVID-19 in the United States: County-level panel data analysis. Proceedings of the National Academy of Sciences, 118(42):e2103420118.
- Split-panel jackknife estimation of fixed-effect models. The Review of Economic Studies, 82(3):991–1030.
- Dirksen, S. (2015). Tail bounds via generic chaining. Electronic Journal of Probability, 20:1–29.
- Choosing instrumental variables in conditional moment restriction models. Journal of Econometrics, 152(1):28–36.
- A central limit theorem for stationary random fields. Stochastic Processes and their Applications, 123(1):1–14.
- Uniform inference in high-dimensional dynamic panel data models with approximately sparse fixed effects. Econometric Theory, 35(2):295–359.
- Luo, Y. (2016). Selecting informative moments via LASSO. Unpublished manuscript.
- Higher order properties of GMM and generalized empirical likelihood estimators. Econometrica, 72(1):219–255.
- Generalized method of moments with many weak moment conditions. Econometrica, 77(3):687–719.
- Selecting instrumental variables in a data rich environment. Journal of Time Series Econometrics, 1(1).
- The bias of instrumental variable estimators of simultaneous equation systems. International Economic Review, pages 219–228.
- Rio, E. (2009). Moment inequalities for sums of dependent random variables under projective conditions. Journal of Theoretical Probability, 22(1):146–163.
- Reconstruction from anisotropic random measurements. In Proceedings of the 25th Annual Conference on Learning Theory, Proceedings of Machine Learning Research, pages 10.1–10.24.
- Sambale, H. (2020). Some notes on concentration for alpha-subexponential random variables. arXiv preprint arXiv:2002.10761.
- Estimation and inference on heterogeneous treatment effects in high-dimensional dynamic panels. Quantitative Economics, 14(2):471–510.
- Shi, Z. (2016). Econometric estimation with high-dimensional moment equalities. Journal of Econometrics, 195(1):104–119.
- Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1):267–288.
- Wu, W. B. (2005). Nonlinear system theory: Another look at dependence. In Proceedings of the National Academy of Sciences of the United States of America, volume 102, pages 14150–14154. National Acad Sciences.
- Gaussian approximation for high dimensional time series. The Annals of Statistics, 45(5):1895–1919.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.