Training Strategy for a Central Coordination Unit in LLM-Based Agents
Determine an effective training methodology for a central coordination unit that manages interactions among heterogeneous components in LLM-based agents, including whether to optimize the coordination unit end-to-end jointly with the rest of the agent or to train it using supervised labels, reinforcement learning signals, or meta-learning strategies.
References
With this, determining how to train this coordination unit remains an open question -- should it be optimized end-to-end with the rest of the agent? Should it use supervised labels, reinforcement signals, or meta-learning strategies?
— Generalizability of Large Language Model-Based Agents: A Comprehensive Survey
(2509.16330 - Zhang et al., 19 Sep 2025) in Section 2, Subsubsection "Limitations in LLM-Based Agent Architectures and Future Directions"