Finite-horizon impact of Adam’s bias correction
Characterize the finite-time effects of the bias-correction mechanism in Adam-type methods—specifically the time-dependent coefficients c_a(t) and c_b(t) in the continuous-time SDE system (eq:cts-x)–(eq:cts-y)—on optimization dynamics, including their influence on convergence behavior and stability, and derive nonasymptotic bounds that quantify this role over finite horizons.
References
Nevertheless, important open questions remain, including the role of bias correction at finite horizons, convergence rates beyond convex or Polyak-Lojasiewicz regimes, robustness under heavy-tailed or state-dependent gradient noise, the structure of invariant measures induced by coordinatewise preconditioning, and metastability near saddle points in high dimensions.