Markovianity of saddle‑to‑saddle transitions

Ascertain whether the sequence of saddle points visited during gradient flow training in networks defined by Equation (1) is Markovian—i.e., whether the next saddle can be inferred solely from the current one independent of earlier saddles.

Background

Understanding whether saddle visitation is Markovian would clarify the memory dependence of stage-like learning and facilitate predictive models of progression through the hierarchy of saddles.

The authors note that certain heteroclinic networks are known to be non-Markovian in dynamical systems, but the status in neural training dynamics remains unresolved.

References

Several interesting technical questions remain open. Second, is the sequence of saddles visited during training Markovian? That is, can the next saddle be inferred solely from the current one, independent of earlier ones?

Saddle-to-Saddle Dynamics Explains A Simplicity Bias Across Neural Network Architectures (2512.20607 - Zhang et al., 23 Dec 2025) in Appendix C — Technical future directions