Action Volume as an Escalation Indicator in LLM‑Based Wargames
Ascertain whether increases in the number of actions taken over time by large language model agents in simulated wargames constitute a reliable indicator of escalation by quantitatively characterizing the relationship, if any, between action volume and escalation severity in LLM‑driven scenarios.
References
In previous, human-based wargames, more actions over time were an additional indicator of escalation in wargames. Given our results, we can neither confirm nor reject this notion in LLM-based wargames.
                — Escalation Risks from Language Models in Military and Diplomatic Decision-Making
                
                (2401.03408 - Rivera et al., 7 Jan 2024) in Appendix, Total Action Counts Over Time figure caption