Purpose of non-reasoning tokens in chain-of-thought traces
Determine whether tokens not directly related to reasoning (such as grammatical or stylistic tokens) in chain-of-thought outputs serve any function beyond enabling additional forward passes before the model produces a final answer.
References
Two key questions remain open: whether this is the most efficient approach, and whether non-reasoning tokens serve any purpose beyond providing additional forward passes for the model to “ponder” before producing an answer.
— Tiny Recursive Reasoning with Mamba-2 Attention Hybrid
(2602.12078 - Wang et al., 12 Feb 2026) in Section 1 (Introduction)