2000 character limit reached
Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent (2302.07125v1)
Published 14 Feb 2023 in math.PR, cs.LG, math.AP, and stat.ML
Abstract: We propose new limiting dynamics for stochastic gradient descent in the small learning rate regime called stochastic modified flows. These SDEs are driven by a cylindrical Brownian motion and improve the so-called stochastic modified equations by having regular diffusion coefficients and by matching the multi-point statistics. As a second contribution, we introduce distribution dependent stochastic modified flows which we prove to describe the fluctuating limiting dynamics of stochastic gradient descent in the small learning rate - infinite width scaling regime.