Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Agent Environment Cycle Games (2009.13051v3)

Published 28 Sep 2020 in cs.LG, cs.AI, cs.GT, cs.MA, and stat.ML

Abstract: Partially Observable Stochastic Games (POSGs) are the most general and common model of games used in Multi-Agent Reinforcement Learning (MARL). We argue that the POSG model is conceptually ill suited to software MARL environments, and offer case studies from the literature where this mismatch has led to severely unexpected behavior. In response to this, we introduce the Agent Environment Cycle Games (AEC Games) model, which is more representative of software implementation. We then prove it's as an equivalent model to POSGs. The AEC games model is also uniquely useful in that it can elegantly represent both all forms of MARL environments, whereas for example POSGs cannot elegantly represent strictly turn based games like chess.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Nathaniel Grammel (13 papers)
  2. Benjamin Black (7 papers)
  3. Ananth Hari (4 papers)
  4. Caroline Horsch (4 papers)
  5. Luis Santos (72 papers)
  6. J K Terry (2 papers)
Citations (6)