Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Generic Multi-Player Transformation Algorithm for Solving Large-Scale Zero-Sum Extensive-Form Adversarial Team Games

Published 4 Jul 2023 in cs.GT | (2307.01441v1)

Abstract: Many recent practical and theoretical breakthroughs focus on adversarial team multi-player games (ATMGs) in ex ante correlation scenarios. In this setting, team members are allowed to coordinate their strategies only before the game starts. Although there existing algorithms for solving extensive-form ATMGs, the size of the game tree generated by the previous algorithms grows exponentially with the number of players. Therefore, how to deal with large-scale zero-sum extensive-form ATMGs problems close to the real world is still a significant challenge. In this paper, we propose a generic multi-player transformation algorithm, which can transform any multi-player game tree satisfying the definition of AMTGs into a 2-player game tree, such that finding a team-maxmin equilibrium with correlation (TMECor) in large-scale ATMGs can be transformed into solving NE in 2-player games. To achieve this goal, we first introduce a new structure named private information pre-branch, which consists of a temporary chance node and coordinator nodes and aims to make decisions for all potential private information on behalf of the team members. We also show theoretically that NE in the transformed 2-player game is equivalent TMECor in the original multi-player game. This work significantly reduces the growth of action space and nodes from exponential to constant level. This enables our work to outperform all the previous state-of-the-art algorithms in finding a TMECor, with 182.89, 168.47, 694.44, and 233.98 significant improvements in the different Kuhn Poker and Leduc Poker cases (21K3, 21K4, 21K6 and 21L33). In addition, this work first practically solves the ATMGs in a 5-player case which cannot be conducted by existing algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (33)
  1. The hanabi challenge: A new frontier for ai research. Artificial Intelligence 280 (2020), 103216.
  2. Computing the Team-maxmin Equilibrium in Single-Team Single-Adversary Team Games. Intelligenza Artificiale 11, 1 (2017), 67–79. https://doi.org/10.3233/IA-170107
  3. Team-Maxmin Equilibrium: Efficiency Bounds and Algorithms. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4-9, 2017, San Francisco, California, USA, Satinder Singh and Shaul Markovitch (Eds.). AAAI Press, 356–362. http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14264
  4. Noam Brown. 2020. Equilibrium finding for large adversarial imperfect-information games. PhD thesis (2020).
  5. Deep Counterfactual Regret Minimization. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 793–802.
  6. Noam Brown and Tuomas Sandholm. 2017. Safe and nested subgame solving for imperfect-information games. Advances in neural information processing systems 30 (2017).
  7. Noam Brown and Tuomas Sandholm. 2019a. Solving Imperfect-Information Games via Discounted Regret Minimization. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019. AAAI Press, 1829–1836. https://doi.org/10.1609/aaai.v33i01.33011829
  8. Noam Brown and Tuomas Sandholm. 2019b. Superhuman AI for multiplayer poker. Science 365, 6456 (2019), 885–890.
  9. Multi-Agent Coordination in Adversarial Environments through Signal Mediated Strategies. In AAMAS ’21: 20th International Conference on Autonomous Agents and Multiagent Systems, Virtual Event, United Kingdom, May 3-7, 2021. ACM, 269–278. https://doi.org/10.5555/3463952.3463989
  10. A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving. In International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA (Proceedings of Machine Learning Research, Vol. 162), Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu, and Sivan Sabato (Eds.). PMLR, 2638–2657.
  11. Coordination in adversarial sequential team games via multi-agent deep reinforcement learning. arXiv preprint arXiv:1912.07712 (2019).
  12. Andrea Celli and Nicola Gatti. 2018. Computational Results for Extensive-Form Adversarial Team Games. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2-7, 2018, Sheila A. McIlraith and Kilian Q. Weinberger (Eds.). AAAI Press, 965–972.
  13. Xi Chen and Xiaotie Deng. 2005. 3-Nash is PPAD-complete. In Electronic Colloquium on Computational Complexity, Vol. 134. Citeseer, 2–29.
  14. Ex ante coordination and collusion in zero-sum multi-player extensive-form games. Advances in Neural Information Processing Systems 31 (2018).
  15. Ex ante coordination and collusion in zero-sum multi-player extensive-form games. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada. 9661–9671. https://proceedings.neurips.cc/paper/2018/hash/c17028c9b6e0c5deaad29665d582284a-Abstract.html
  16. Connecting Optimal Ex-Ante Collusion in Teams to Extensive-Form Correlation: Faster Algorithms and Positive Complexity Results. In Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event (Proceedings of Machine Learning Research, Vol. 139), Marina Meila and Tong Zhang (Eds.). PMLR, 3164–3173. http://proceedings.mlr.press/v139/farina21a.html
  17. Approximability and parameterized complexity of minmax values. In Internet and Network Economics: 4th International Workshop, WINE 2008, Shanghai, China, December 17-20, 2008. Proceedings 4. Springer, 684–695.
  18. Approximability and Parameterized Complexity of Minmax Values. In Internet and Network Economics, 4th International Workshop, WINE 2008, Shanghai, China, December 17-20, 2008. Proceedings (Lecture Notes in Computer Science, Vol. 5385), Christos H. Papadimitriou and Shuzhong Zhang (Eds.). Springer, 684–695. https://doi.org/10.1007/978-3-540-92185-1_74
  19. Strategic path reliability in information networks. Technical Report. DIW Discussion Papers.
  20. Harold W Kuhn. 1950a. Extensive games. Proceedings of the National Academy of Sciences 36, 10 (1950), 570–576.
  21. Harold W Kuhn. 1950b. A simplified two-person poker. Contributions to the Theory of Games 1 (1950), 97–103.
  22. Monte Carlo sampling for regret minimization in extensive games. Advances in neural information processing systems 22 (2009).
  23. John Nash. 1951. Non-cooperative games. Annals of mathematics (1951), 286–295.
  24. Bayes? Bluff: Opponent Modelling in Poker. In UAI ’05, Proceedings of the 21st Conference in Uncertainty in Artificial Intelligence, Edinburgh, Scotland, July 26-29, 2005. AUAI Press, 550–558.
  25. Solving Heads-Up Limit Texas Hold’em. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015, Buenos Aires, Argentina, July 25-31, 2015, Qiang Yang and Michael J. Wooldridge (Eds.). AAAI Press, 645–652.
  26. Bernhard von Stengel and Daphne Koller. 1997. Team-maxmin equilibria. Games and Economic Behavior 21, 1-2 (1997), 309–321.
  27. Brian Hu Zhang and Tuomas Sandholm. 2022. Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022. AAAI Press, 5252–5259.
  28. Youzhi Zhang and Bo An. 2020a. Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. AAAI Press, 2318–2325. https://ojs.aaai.org/index.php/AAAI/article/view/5610
  29. Youzhi Zhang and Bo An. 2020b. Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event (Proceedings of Machine Learning Research, Vol. 119). PMLR, 11033–11043. http://proceedings.mlr.press/v119/zhang20c.html
  30. Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. AAAI Press, 5813–5821.
  31. Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022, Luc De Raedt (Ed.). ijcai.org, 606–612. https://doi.org/10.24963/ijcai.2022/86
  32. Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net. https://openreview.net/forum?id=rJx4p3NYDB
  33. Regret minimization in games with incomplete information. Advances in neural information processing systems 20 (2007).

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.