Dice Question Streamline Icon: https://streamlinehq.com

Sharpness of error bounds for cross-validated Lasso

Ascertain whether the established error bounds for the full-sample Lasso estimator with V-fold cross-validation selection of the penalty parameter, namely ||β̂^{FVCV} − β||_2 = O_P(√(s (log p)^2 / n)) and ||β̂^{FVCV} − β||_1 = O_P(√(s^2 (log p)^5 / n)), are sharp.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper discusses V-fold cross-validation for selecting the Lasso penalty parameter, noting that while average estimators admit oracle inequalities, analyzing the full-sample estimator is more intricate. Using a relationship between full-sample and subsample solutions plus degrees-of-freedom arguments, current theory yields bounds for the full-sample Lasso that include additional log p factors compared to the standard high-dimensional rates.

The authors highlight that these bounds may be suboptimal relative to bootstrap- or deviation-based tuning rules, and explicitly state that it is uncertain whether the cross-validation-based bounds are tight, leaving open the question of their sharpness.

References

These bounds, however, are by some \log p factors worse than those in eq: lasso rate of convergence, which are guaranteed for the Lasso estimator with the penalty parameter selected either by the BCCH method or by the bootstrap method. It is therefore not clear whether the bounds eq: clc bounds are sharp.

eq: lasso rate of convergence:

β^β2=OP(slogpn)andβ^β1=OP(s2logpn).\|\widehat \beta - \beta\|_{2} = O_P\left(\sqrt{\frac{s\log p}{n}}\right)\quad\text{and}\quad \|\widehat \beta - \beta\|_{1} = O_P\left(\sqrt{\frac{s^2\log p}{n}}\right).

eq: clc bounds:

β^β2=OP(s(logp)2n)andβ^β1=OP(s2(logp)5n).\|\widehat \beta - \beta\|_{2} = O_P\left(\sqrt{\frac{s(\log p)^2}{n}}\right)\quad\text{and}\quad \|\widehat \beta - \beta\|_{1} = O_P\left(\sqrt{\frac{s^2(\log p)^5}{n}}\right).

Tuning parameter selection in econometrics (2405.03021 - Chetverikov, 5 May 2024) in Section 3.5 (Selection via Cross-Validation)