Turn-budget allocation in multi-turn agentic reasoning
Determine the optimal allocation of interaction turn budgets between internal reasoning tokens and external tool calls for large language model agents performing multi-turn agentic reasoning.
References
Open puzzles are unsolved regarding the allocation of turn budgets, the trade-off between response length and tool-call efficiency, and the impact of long-CoT predispositions on multi-turn reasoning.
— Demystifying Reinforcement Learning in Agentic Reasoning
(2510.11701 - Yu et al., 13 Oct 2025) in Introduction, Reasoning Mode-wise paragraph (#1{3})