Agent models: Internalizing Chain-of-Action Generation into Reasoning models (2503.06580v1)

Published 9 Mar 2025 in cs.AI

Abstract: Traditional agentic workflows rely on external prompts to manage interactions with tools and the environment, which limits the autonomy of reasoning models. We position \emph{Large Agent Models (LAMs)} that internalize the generation of \emph{Chain-of-Action (CoA)}, enabling the model to autonomously decide when and how to use external tools. Our proposed AutoCoA framework combines supervised fine-tuning (SFT) and reinforcement learning (RL), allowing the model to seamlessly switch between reasoning and action while efficiently managing environment interactions. Main components include step-level action triggering, trajectory-level CoA optimization, and an internal world model to reduce real-environment interaction costs. Evaluations on open-domain QA tasks demonstrate that AutoCoA-trained agent models significantly outperform ReAct-based workflows in task completion, especially in tasks that require long-term reasoning and multi-step actions. Code and dataset are available at https://github.com/ADaM-BJTU/AutoCoA

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - ADaM-BJTU/AutoCoA (11 stars)

Tweets

https://twitter.com/GptMaestro/status/1902267372003824080

[2503.06580] Agent models: Internalizing Chain-of-Action Generation into Reasoning models (1 point, 0 comments)

Agent models: Internalizing Chain-of-Action Generation into Reasoning models (2503.06580v1)

Summary

Related Papers

GitHub

Tweets

Reddit