Implicit bias of optimization in general neural network architectures
Characterize the implicit bias induced by gradient-based optimization methods (such as gradient descent, stochastic gradient descent, and Adam) on general neural network architectures, determining which solutions are selected among multiple empirical risk minimizers.
Sponsor
References
The algorithm biases induced by the optimization methods, known as implicit bias, on general neural network architectures are still largely unknown.
— Geometry of Polynomial Neural Networks
(2402.00949 - Kubjas et al., 1 Feb 2024) in Section 7 (Optimization), opening paragraphs