Generalization of structural error fingerprints across architectures and scales
Determine whether the structural fingerprints of reasoning errors identified by Circuit-based Reasoning Verification on attribution graphs generalize to different model architectures, such as Mixture-of-Experts Transformers, and to substantially larger model scales (e.g., 70B parameters and above).
References
Whether the precise structural fingerprints we identified generalize to different architectural paradigms, such as Mixture-of-Experts, or across significant model scales (e.g., 70B and larger) remains an open question.
— Verifying Chain-of-Thought Reasoning via Its Computational Graph
(2510.09312 - Zhao et al., 10 Oct 2025) in Limitations, subsection "Generalizability of Error Signatures"