Generalization across graph topologies and scales
Ascertain whether the observed implicit in-weights reasoning and geometric memory in Transformer and Mamba models extend beyond path-star and tree-star graphs to other graph topologies and to graphs of different sizes.
References
It is unclear how well this generalizes to other topologies, and to graphs of other sizes.
— Deep sequence models tend to memorize geometrically; it is unclear why
(2510.26745 - Noroozizadeh et al., 30 Oct 2025) in Section: Limitations (bullet 1)