Tight bounds for maximum $\ell_1$-margin classifiers (2212.03783v2)

Published 7 Dec 2022 in stat.ML and cs.LG

Abstract: Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum $\ell_1$-margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separable. Previous works consistently show that many estimators relying on the $\ell_1$-norm achieve improved statistical rates for hard sparse ground truths. We show that surprisingly, this adaptivity does not apply to the maximum $\ell_1$-margin classifier for a standard discriminative setting. In particular, for the noiseless setting, we prove tight upper and lower bounds for the prediction error that match existing rates of order $\frac{|w^{*|_1^{{2/3}}{n^{1/3}}$}} for general ground truths. To complete the picture, we show that when interpolating noisy observations, the error vanishes at a rate of order $\frac{1}{\sqrt{\log(d/n)}}$. We are therefore first to show benign overfitting for the maximum $\ell_1$-margin classifier.

Authors (3)

Stefan Stojanovic (5 papers)
Konstantin Donhauser (17 papers)
Fanny Yang (38 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tight bounds for maximum $\ell_1$-margin classifiers (2212.03783v2)

Summary

Related Papers