Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers (2106.09435v3)

Published 17 Jun 2021 in cs.MA, cs.AI, cs.GT, and cs.LG

Abstract: Two-player, constant-sum games are well studied in the literature, but there has been limited progress outside of this setting. We propose Joint Policy-Space Response Oracles (JPSRO), an algorithm for training agents in n-player, general-sum extensive form games, which provably converges to an equilibrium. We further suggest correlated equilibria (CE) as promising meta-solvers, and propose a novel solution concept Maximum Gini Correlated Equilibrium (MGCE), a principled and computationally efficient family of solutions for solving the correlated equilibrium selection problem. We conduct several experiments using CE meta-solvers for JPSRO and demonstrate convergence on n-player, general-sum games.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Luke Marris (23 papers)
  2. Paul Muller (25 papers)
  3. Marc Lanctot (60 papers)
  4. Karl Tuyls (58 papers)
  5. Thore Graepel (48 papers)
Citations (33)
X Twitter Logo Streamline Icon: https://streamlinehq.com