Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Arena: a toolkit for Multi-Agent Reinforcement Learning (1907.09467v1)

Published 20 Jul 2019 in cs.LG, cs.AI, and cs.MA

Abstract: We introduce Arena, a toolkit for multi-agent reinforcement learning (MARL) research. In MARL, it usually requires customizing observations, rewards and actions for each agent, changing cooperative-competitive agent-interaction, and playing with/against a third-party agent, etc. We provide a novel modular design, called Interface, for manipulating such routines in essentially two ways: 1) Different interfaces can be concatenated and combined, which extends the OpenAI Gym Wrappers concept to MARL scenarios. 2) During MARL training or testing, interfaces can be embedded in either wrapped OpenAI Gym compatible Environments or raw environment compatible Agents. We offer off-the-shelf interfaces for several popular MARL platforms, including StarCraft II, Pommerman, ViZDoom, Soccer, etc. The interfaces effectively support self-play RL and cooperative-competitive hybrid MARL. Also, Arena can be conveniently extended to your own favorite MARL platform.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Qing Wang (341 papers)
  2. Jiechao Xiong (21 papers)
  3. Lei Han (91 papers)
  4. Meng Fang (100 papers)
  5. Xinghai Sun (4 papers)
  6. Zhuobin Zheng (4 papers)
  7. Peng Sun (210 papers)
  8. Zhengyou Zhang (21 papers)
Citations (4)