Propagation of Template Collapse in Multi‑Agent Reinforcement Learning
Determine how template collapse—defined as the degradation where reasoning becomes input‑agnostic across inputs while within‑input diversity remains high—propagates in multi‑agent reinforcement learning settings, identifying the mechanisms and dynamics by which this phenomenon emerges and spreads among interacting agents.
References
All experiments are single-agent; how template collapse propagates in multi-agent RL remains open.
— RAGEN-2: Reasoning Collapse in Agentic RL
(2604.06268 - Wang et al., 7 Apr 2026) in Conclusions and Limitations