Understanding training dynamics of deep neural networks
Establish a rigorous, general theory explaining the training dynamics of deep neural networks, characterizing how optimization processes evolve and under what conditions they converge or reach stationary behavior, in order to clarify the mechanisms that govern empirical performance and guide principled choices of training hyperparameters.
References
Understanding the training dynamics of deep neural networks remains a major open problem, with physics-inspired approaches offering promising insights.
— Can Training Dynamics of Scale-Invariant Neural Networks Be Explained by the Thermodynamics of an Ideal Gas?
(2511.07308 - Sadrtdinov et al., 10 Nov 2025) in Abstract (page 1)