Vanishing learning rate regime for stochastic FTRL
Determine the long-run behavior of stochastic follow-the-regularized-leader (S-FTRL) dynamics when run with a vanishing learning rate. In particular, ascertain whether the closedness–stochastic asymptotic stability equivalence persists, and whether the boundary-drift phenomenon in harmonic games fails, potentially yielding convergence to a constant Bregman-distance neighborhood of a strategic center.
Sponsor
References
Several important directions remain open in the general context of stochastic \ac{FTRL} dynamics. The first is what happens if the dynamics are run with a vanishing learning rate, as in . In this case, we conjecture that an analogue of \cref{thm:club} continues to hold, but \cref{thm:harmonic} fails: in games where the deterministic dynamics are recurrent (e.g., harmonic games), eq:FTRL-stoch will most likely result in the sequence of play converging to some constant (Bregman) distance from a ``central'' equilibrium of the game (the game's strategic center in the case of harmonic games).