Explaining why reinforcement learning induces verification but not meta-cognitive monitoring
Explain the underlying mechanisms by which reinforcement learning training induces verification behaviors in large language models but fails to produce meta-cognitive monitoring or representational restructuring.
References
We cannot explain why RL produces verification but not meta-cognitive monitoring, or whether architectural prerequisites exist for specific behaviors versus emerging from scale \citep{le2025reasoning, das2025can}.
— Cognitive Foundations for Reasoning and Their Manifestation in LLMs
(2511.16660 - Kargupta et al., 20 Nov 2025) in Section: Opportunities and Challenges — Predicting cognitive capabilities from training procedure