2000 character limit reached
Robustly Learning a Single Neuron via Sharpness (2306.07892v1)
Published 13 Jun 2023 in cs.LG, cs.DS, math.OC, math.ST, stat.ML, and stat.TH
Abstract: We study the problem of learning a single neuron with respect to the $L_22$-loss in the presence of adversarial label noise. We give an efficient algorithm that, for a broad family of activations including ReLUs, approximates the optimal $L_22$-error within a constant factor. Our algorithm applies under much milder distributional assumptions compared to prior work. The key ingredient enabling our results is a novel connection to local error bounds from optimization theory.