Manifestation of Confidence in LLM Reasoning
Determine how confidence is manifested within the reasoning process of large language models by identifying and characterizing the internal signals present during token generation that reflect confidence in intermediate steps and final answers.
References
While using confidence as a correctness proxy aligns with cognitive principles, a critical question remains: how is confidence actually manifested in the reasoning process of LLMs?
— Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning
(2510.17923 - Tang et al., 20 Oct 2025) in Section 1 (Introduction)