Stability and L2-penalty in Model Averaging (2311.13827v1)
Abstract: Model averaging has received much attention in the past two decades, which integrates available information by averaging over potential models. Although various model averaging methods have been developed, there are few literatures on the theoretical properties of model averaging from the perspective of stability, and the majority of these methods constrain model weights to a simplex. The aim of this paper is to introduce stability from statistical learning theory into model averaging. Thus, we define the stability, asymptotic empirical risk minimizer, generalization, and consistency of model averaging and study the relationship among them. Our results indicate that stability can ensure that model averaging has good generalization performance and consistency under reasonable conditions, where consistency means model averaging estimator can asymptotically minimize the mean squared prediction error. We also propose a L2-penalty model averaging method without limiting model weights and prove that it has stability and consistency. In order to reduce the impact of tuning parameter selection, we use 10-fold cross-validation to select a candidate set of tuning parameters and perform a weighted average of the estimators of model weights based on estimation errors. The Monte Carlo simulation and an illustrative application demonstrate the usefulness of the proposed method.
- Information theory and an extension of the maximum likelihood principle, in: Selected papers of Hirotugu Akaike, pp. 199–213.
- Stability and generalization. The Journal of Machine Learning Research 2, 499–526.
- Model selection: An integral part of inference. Biometrics 53, 603–618.
- Recursive stability analysis of linear regression relationships: An exploratory methodology. Journal of Econometrics 19, 31–76.
- Bayesian model averaging: A systematic review and conceptual classification. International Statistical Review 86, 1–28.
- Model averaging based on leave-subject-out cross-validation. Journal of Econometrics 192, 139–151.
- Least squares model averaging. Econometrica 75, 1175–1189.
- Least-squares forecast averaging. Journal of Econometrics 146, 342–350.
- Jackknife model averaging. Journal of Econometrics 167, 38–46.
- Frequentist model average estimators. Journal of the American Statistical Association 98, 879–899.
- Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 55–67.
- Almost-everywhere algorithmic stability and generalization error, in: In Proceedings of the 18th Conference in Uncertainty in Artificial Intelligence, p. 275–282.
- Least squares model averaging based on generalized cross validation. Acta Mathematicae Applicatae Sinica, English Series 37, 495–509.
- Optimal weight choice for frequentist model average estimators. Journal of the American Statistical Association 106, 1053–1066.
- Corrected Mallows criterion for model averaging. Computational Statistics and Data Analysis 144, 106902.
- Heteroskedasticity-robust Cpsubscript𝐶𝑝C_{p}italic_C start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT model averaging. The Econometrics Journal 16, 463–472.
- Some comments on Cpsubscript𝐶𝑝C_{p}italic_C start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Technometrics 15, 661–675.
- Model averaging in economics: An overview. Journal of Economic Surveys 29, 46–75.
- Learning theory: stability is sufficient for generalization and necessary and sufficient for consistency of empirical risk minimization. Advances in Computational Mathematics 25, 161–193.
- Stability results in learning theory. Analysis and Applications 3, 397–417.
- Shrinkage averaging estimation. Statistical Papers 53, 1015–1034.
- Estimating the dimension of a model. The Annals of Statistics 6, 461–464.
- Learnability, stability and uniform convergence. The Journal of Machine Learning Research 11, 2635–2670.
- Learning performance of regularized moving least square regression. Journal of Computational and Applied Mathematics 325, 42–55.
- Statistical Learning Theory. Wiley.
- On the use of model averaging in tourism research. Annals of Tourism Research 36, 525–532.
- Least squares model averaging by Mallows criterion. Journal of Econometrics 156, 277–283.
- Learning with differential privacy: Stability, learnability and the sufficiency and necessity of erm principle. The Journal of Machine Learning Research 17, 6353–6392.
- Introductory Econometrics. Thompson South-Western.
- Focused information criterion and model averaging based on weighted composite quantile regression. Scandinavian Journal of Statistics 41, 365–381.
- Adaptive regression by mixing. Journal of the American Statistical Association 96, 574–588.
- Combining linear regression models: When and how? Journal of the American Statistical Association 100, 1202–1214.
- Cross-validation model averaging for generalized functional linear model. Econometrics 8, 7.
- Focused information criterion and model averaging for generalized additive partial linear models. The Annals of Statistics 39, 174–200.
- Focused information criteria, model selection, and model averaging in a Tobit model with a nonzero threshold. Journal of Business and Economic Statistics 30, 132–142.
- Model averaging by jackknife criterion in models with dependent data. Journal of Econometrics 174, 82–94.
- Model averaging method and its application in forecast. Statistical Research 28, 6.
- Model averaging estimator in ridge regression and its large sample properties. Statistical Papers 61, 1719–1739.
- On the adaptive elastic-net with a diverging number of parameters. The Annals of Statistics 37, 1733–1751.