War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars (2311.17227v2)
Abstract: Can we avoid wars at the crossroads of history? This question has been pursued by individuals, scholars, policymakers, and organizations throughout human history. In this research, we attempt to answer the question based on the recent advances of AI and LLMs. We propose \textbf{WarAgent}, an LLM-powered multi-agent AI system, to simulate the participating countries, their decisions, and the consequences, in historical international conflicts, including the World War I (WWI), the World War II (WWII), and the Warring States Period (WSP) in Ancient China. By evaluating the simulation effectiveness, we examine the advancements and limitations of cutting-edge AI systems' abilities in studying complex collective human behaviors such as international conflicts under diverse settings. In these simulations, the emergent interactions among agents also offer a novel perspective for examining the triggers and conditions that lead to war. Our findings offer data-driven and AI-augmented insights that can redefine how we approach conflict resolution and peacekeeping strategies. The implications stretch beyond historical analysis, offering a blueprint for using AI to understand human history and possibly prevent future international conflicts. Code and data are available at \url{https://github.com/agiresearch/WarAgent}.
- Robert B Smith. Presidential decision-making during the cuban missile crisis: a computer simulation. Simulation & Games, 1(2):173–201, 1970.
- An attempt to simulate the outbreak of world war i. American Political Science Review, 61(2):400–416, 1967.
- Generative agents: Interactive simulacra of human behavior. arXiv preprint arXiv:2304.03442, 2023.
- Exploring large language models for communication games: An empirical study on werewolf. arXiv preprint arXiv:2309.04658, 2023.
- Put your money where your mouth is: Evaluating strategic planning and execution of llm agents in an auction arena. arXiv preprint arXiv:2310.05746, 2023.
- OpenAGI: When LLM meets domain experts. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Improving factuality and reasoning in language models through multiagent debate, 2023.
- Chateval: Towards better llm-based evaluators through multi-agent debate, 2023.
- Corex: Pushing the boundaries of complex reasoning through multi-model collaboration, 2023.
- Encouraging divergent thinking in large language models through multi-agent debate, 2023.
- Humanoid agents: Platform for simulating human-like generative agents, 2023.
- Improving language model negotiation with self-play and in-context learning from ai feedback. arXiv preprint arXiv:2305.10142, 2023.
- Metagpt: Meta programming for multi-agent collaborative framework. arXiv preprint arXiv:2308.00352, 2023.
- Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents. arXiv preprint arXiv:2308.05960, 2023.
- Communicative agents for software development, 2023.
- Yohei Nakajima. Babyagi. https://github.com/yoheinakajima/babyagi, 2023.
- Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors in agents. arXiv preprint arXiv:2308.10848, 2023.
- Camel: Communicative agents for "mind" exploration of large scale language model society, 2023.
- Ted Dickson. The road to united states involvement in world war i: A simulation. OAH Magazine of History, 17(1):48–56, 2002.
- Simulation in international relations: Developments for research and teaching. (No Title), 1963.
- Onesaf objective system (oos) behavior model verification. US Army TRADOC Analysis Center–Monterey, Monterey, CA, 2008.
- Creating a world war ii combat simulator using onesaf objective system. In Proceedings of the Interservice/Industry Training, Simulation, and Education Conference, pages 510–520, 2006.
- Using agent-based simulation and game theory to examine the wwii bay of biscay u-boat campaign. The Journal of Defense Modeling and Simulation, 1(2):99–109, 2004.
- Lost in the middle: How language models use long contexts. arXiv preprint arXiv:2307.03172, 2023.
- Can you follow me? testing situational understanding in chatgpt. arXiv preprint arXiv:2310.16135, 2023.
- Annika Mombauer. The origins of the First World War: controversies and consensus. Routledge, 2013.
- Otto Pick. Who pulled the trigger: Soviet historians and the origins of world war ii. Probs. Communism, 9:64, 1960.
- Annette L Juliano. The warring states period—the state of qin, yan, chu, and pazyryk: A historical footnote. Source: Notes in the History of Art, 10(4):25–29, 1991.
- OpenAI. Gpt-4 technical report, 2023.
- John Keegan. The first world war. Random House, 2014.
- Epps Vinh. Bailey,(2010). Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance. JMLR, pages 2837–2854, 2010.
- Mining of massive datasets. Cambridge University Press, 2011.