Explicit form of the potential Φ for stationary SGD on the unit sphere
Determine the explicit functional form of the potential Φ over unit weight directions that defines the stationary Gibbs distribution ρ(ȳ) ∝ exp(−Φ(ȳ)/T0) for stochastic gradient descent with weight decay in scale-invariant neural networks when the gradient-noise covariance Σ(ȳ) is anisotropic and spatially dependent, so that predictions beyond tests V1 and V3 can be directly verified.
References
However, since Φ is unknown, we can only directly verify V1 and V3.
— Can Training Dynamics of Scale-Invariant Neural Networks Be Explained by the Thermodynamics of an Ideal Gas?
(2511.07308 - Sadrtdinov et al., 10 Nov 2025) in Section 6.1 (Generalizing isotropic noise model)