Existence conditions for mobility–potential factorization in non-SGD learning dynamics
Determine necessary and sufficient conditions under which a non-conservative update force field F(W) in the parameter space of a non–stochastic-gradient-descent learning algorithm admits a factorization F(W) = μ(W) ∇U(W), where μ(W) is a symmetric positive-definite mobility matrix and U(W) is a scalar potential, so that the stationary-distribution analysis analogous to SGD applies.
References
The existence conditions of such factorization for any given force might be an open question, but even in the absence of such factorization, the only difference occurs in the Hessian matrix.
— On Networks and their Applications: Stability of Gene Regulatory Networks and Gene Function Prediction using Autoencoders
(2408.07064 - Coban, 13 Aug 2024) in Additional Discussion, Dynamics of Non-SGD Based Learning Algorithms, The Stationary Distribution of General Learning