Distance‑to‑manifold criterion for feature learning strength
Establish whether, in architectures that exhibit saddle-to-saddle dynamics, the Euclidean distance from the initial weights to invariant manifolds associated with low effective width determines the strength of feature learning, including the prominence of plateaus and the trajectory’s proximity to saddles.
Sponsor
References
In architectures that have saddle-to-saddle dynamics, we conjecture that the distance from the initial weights to invariant manifolds associated with low effective width determines the strength of feature learning.
— Saddle-to-Saddle Dynamics Explains A Simplicity Bias Across Neural Network Architectures
(2512.20607 - Zhang et al., 23 Dec 2025) in Section 6 — Effect of initialization scale