Observable Perfect Equilibrium (2210.16506v9)
Abstract: While Nash equilibrium has emerged as the central game-theoretic solution concept, many important games contain several Nash equilibria and we must determine how to select between them in order to create real strategic agents. Several Nash equilibrium refinement concepts have been proposed and studied for sequential imperfect-information games, the most prominent being trembling-hand perfect equilibrium, quasi-perfect equilibrium, and recently one-sided quasi-perfect equilibrium. These concepts are robust to certain arbitrarily small mistakes, and are guaranteed to always exist; however, we argue that neither of these is the correct concept for developing strong agents in sequential games of imperfect information. We define a new equilibrium refinement concept for extensive-form games called observable perfect equilibrium in which the solution is robust over trembles in publicly-observable action probabilities (not necessarily over all action probabilities that may not be observable by opposing players). Observable perfect equilibrium correctly captures the assumption that the opponent is playing as rationally as possible given mistakes that have been observed (while previous solution concepts do not). We prove that observable perfect equilibrium is always guaranteed to exist, and demonstrate that it leads to a different solution than the prior extensive-form refinements in no-limit poker. We expect observable perfect equilibrium to be a useful equilibrium refinement concept for modeling many important imperfect-information games of interest in artificial intelligence.
- Nick Abou Risk and Duane Szafron. Using counterfactual regret minimization to create competitive multiplayer poker agents. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2010.
- The Mathematics of Poker. ConJelCo LLC, Pittsburgh, PA, USA, 2006.
- Approximating game-theoretic optimal strategies for full-scale poker. In Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI), 2003.
- Superhuman AI for heads-up no-limit poker: Libratus beats top professionals. Science, 359:418–424, 2017.
- Superhuman AI for multiplayer poker. Science, 365:885–890, 2019.
- Xi Chen and Xiaotie Deng. 3-Nash is PPAD-complete. Electronic Colloquium on Computational Complexity, Report No. 134:1–12, 2005.
- Xi Chen and Xiaotie Deng. Settling the complexity of 2-player Nash equilibrium. In Proceedings of the Annual Symposium on Foundations of Computer Science (FOCS), 2006.
- The complexity of computing a Nash equilibrium. SIAM Journal on Computing, 1(39):195–259, 2009.
- Equilibrium refinement for the age of machines: The one-sided quasi-perfect equilibrium. In Conference on Neural Information Processing Systems (NeurIPS), 2021.
- Sam Ganzfried. Reflections on the first man vs. machine no-limit Texas hold ’em competition. AI Magazine, 38(2), 2017.
- Game theory-based opponent modeling in large imperfect-information games. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), pages 533–540, 2011.
- Action translation in extensive-form games with large action spaces: Axioms, paradoxes, and the pseudo-harmonic mapping. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 2013.
- Endgame solving in large imperfect-information games. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2015.
- Safe opponent exploitation. ACM Transactions on Economics and Computation (TEAC), 3(8):1–28, 2015.
- A competitive Texas Hold’em poker player via automated abstraction and real-time equilibrium computation. In Proceedings of the National Conference on Artificial Intelligence (AAAI), 2006.
- Better automated abstraction techniques for imperfect information games, with application to Texas Hold’em poker. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2007.
- A heads-up no-limit Texas Hold’em poker player: Discretized betting models and automatically generated equilibrium-finding programs. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2008.
- Smoothing techniques for computing Nash equilibria of sequential games. Mathematics of Operations Research, 35(2):494–512, 2010.
- Eric Jackson. Slumbot NL: Solving large games with counterfactual regret minimization using sampling and distributed processing. In AAAI Workshop on Computer Poker and Incomplete Information, 2013.
- Evaluating state-space abstractions in extensive-form games. In Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2013.
- Fast algorithms for finding randomized strategies in game trees. In Proceedings of the 26th ACM Symposium on Theory of Computing (STOC), pages 750–760, 1994.
- Deepstack: Expert-level artificial intelligence in heads-up no-limit poker. Science, 356:508–513, 2017.
- John Nash. Equilibrium points in n-person games. Proceedings of the National Academy of Sciences, 36:48–49, 1950.
- John Nash. Non-cooperative games. Annals of Mathematics, 54:289–295, 1951.
- Aviad Rubinstein. Settling the complexity of computing approximate two-player Nash equilibria. In Proceedings of the Annual Symposium on Theory of Computing (STOC), 2017.
- Reinhard Selten. Reexamination of the perfectness concept for equilibrium points in extensive games. International Journal of Game Theory, 4:25–55, 1975.
- Abstraction methods for game theoretic poker. In CG ’00: Revised Papers from the Second International Conference on Computers and Games, London, UK, 2002. Springer-Verlag.
- Eric van Damme. A relation between perfect equilibria in extensive form games and proper equilibria in normal form games. International Journal of Game Theory, 13:1–13, 1984.
- Eric van Damme. Stability and Perfection of Nash Equilibrium. Springer-Verlag, New York, New York, 1987.
- A practical use of imperfect recall. In Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA), 2009.
- Regret minimization in games with incomplete information. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS), 2007.