Turn-budget allocation in multi-turn agentic reasoning

Determine the optimal allocation of interaction turn budgets between internal reasoning tokens and external tool calls for large language model agents performing multi-turn agentic reasoning.

Background

Agentic reasoning introduces a budget across multiple rounds of interaction, where LLM agents must balance internal inference and external tool invocations. Poor allocation can lead to overthinking or premature tool reliance.

The authors identify unresolved questions around this allocation, motivating empirical analyses of deliberative versus reactive tool-use strategies and their effects on accuracy and efficiency.

References

Open puzzles are unsolved regarding the allocation of turn budgets, the trade-off between response length and tool-call efficiency, and the impact of long-CoT predispositions on multi-turn reasoning.

— Demystifying Reinforcement Learning in Agentic Reasoning (2510.11701 - Yu et al., 13 Oct 2025) in Introduction, Reasoning Mode-wise paragraph (#1{3})

Turn-budget allocation in multi-turn agentic reasoning

Sponsor

Background

References

Related Problems