Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Query-Efficient Algorithm to Find all Nash Equilibria in a Two-Player Zero-Sum Matrix Game (2310.16236v3)

Published 24 Oct 2023 in cs.GT

Abstract: We study the query complexity of finding the set of all Nash equilibria $\mathcal X_\star \times \mathcal Y_\star$ in two-player zero-sum matrix games. Fearnley and Savani (2016) showed that for any randomized algorithm, there exists an $n \times n$ input matrix where it needs to query $\Omega(n2)$ entries in expectation to compute a single Nash equilibrium. On the other hand, Bienstock et al. (1991) showed that there is a special class of matrices for which one can query $O(n)$ entries and compute its set of all Nash equilibria. However, these results do not fully characterize the query complexity of finding the set of all Nash equilibria in two-player zero-sum matrix games. In this work, we characterize the query complexity of finding the set of all Nash equilibria $\mathcal X_\star \times \mathcal Y_\star$ in terms of the number of rows $n$ of the input matrix $A \in \mathbb{R}{n \times n}$, row support size $k_1 := |\bigcup_{x \in \mathcal X_\star} \text{supp}(x)|$, and column support size $k_2 := |\bigcup_{y \in \mathcal Y_\star} \text{supp}(y)|$. We design a simple yet non-trivial randomized algorithm that, with probability $1 - \delta$, returns the set of all Nash equilibria $\mathcal X_\star \times \mathcal Y_\star$ by querying at most $O(nk5 \cdot \text{polylog}(n / \delta))$ entries of the input matrix $A \in \mathbb{R}{n \times n}$, where $k := \max{k_1, k_2}$. This upper bound is tight up to a factor of $\text{poly}(k)$, as we show that for any randomized algorithm, there exists an $n \times n$ input matrix with $\min{k_1, k_2} = 1$, for which it needs to query $\Omega(nk)$ entries in expectation in order to find the set of all Nash equilibria $\mathcal X_\star \times \mathcal Y_\star$.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Ilan Adler. The equivalence of linear programs and zero-sum games. International Journal of Game Theory, 42(1):165, 2013.
  2. Sparse binary zero-sum games. In ACML, 2014.
  3. Yakov Babichenko. Query complexity of approximate nash equilibria. Journal of the ACM (JACM), 63(4):1–24, 2016.
  4. Yakov Babichenko. Informational bounds on equilibria (a survey). ACM SIGecom Exchanges, 17(2):25–45, 2020.
  5. Solutions of discrete, two-person games. Contributions to the Theory of Games, 1:51–72, 1950.
  6. A canonical game–nearly 75 years in the making–showing the equivalence of matrix games and linear programming. Available at SSRN 3851583, 2021.
  7. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends® in Machine Learning, 5(1):1–122, 2012.
  8. Sébastien Bubeck et al. Convex optimization: Algorithms and complexity. Foundations and Trends® in Machine Learning, 8(3-4):231–357, 2015.
  9. Fast policy extragradient methods for competitive games with entropy regularization. Advances in Neural Information Processing Systems, 34:27952–27964, 2021.
  10. Prediction, learning, and games. Cambridge university press, 2006.
  11. George B Dantzig. A proof of the equivalence of the programming problem and the game problem. Activity analysis of production and allocation, 13, 1951.
  12. C Daskalakis and Ioannis Panageas. Last-iterate convergence: Zero-sum games and constrained min-max optimization. In 10th Innovations in Theoretical Computer Science (ITCS) conference, ITCS 2019, 2019.
  13. Progress in approximate nash equilibria. In Proceedings of the 8th ACM Conference on Electronic Commerce, pages 355–358, 2007.
  14. The complexity of computing a nash equilibrium. Communications of the ACM, 52(2):89–97, 2009a.
  15. A note on approximate nash equilibria. Theoretical Computer Science, 410(17):1581–1588, 2009b.
  16. Near-optimal no-regret algorithms for zero-sum games. In Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms, pages 235–254. SIAM, 2011.
  17. Training gans with optimism. In International Conference on Learning Representations (ICLR 2018), 2018.
  18. A polynomial-time algorithm for 1/3-approximate nash equilibria in bimatrix games. arXiv preprint arXiv:2204.11525, 2022.
  19. A polynomial-time algorithm for 1/2-well-supported nash equilibria in bimatrix games. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 3777–3787. SIAM, 2023.
  20. Finding approximate nash equilibria of bimatrix games via payoff queries. ACM Transactions on Economics and Computation (TEAC), 4(4):1–19, 2016.
  21. Learning equilibria of games via payoff queries. In Proceedings of the fourteenth ACM conference on Electronic commerce, pages 397–414, 2013.
  22. The theory of learning in games, volume 2. MIT press, 1998.
  23. Bounds for the query complexity of approximate equilibria. ACM Transactions on Economics and Computation (TEAC), 4(4):1–25, 2016.
  24. A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters, 18(2):53–58, 1995.
  25. Elad Hazan et al. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3-4):157–325, 2016.
  26. Game theory, alive, volume 101. American Mathematical Soc., 2017.
  27. Last iterate convergence in no-regret learning: constrained min-max optimization for convex-concave landscapes. In International Conference on Artificial Intelligence and Statistics, pages 1441–1449. PMLR, 2021.
  28. Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile. In ICLR 2019-7th International Conference on Learning Representations, pages 1–23, 2019.
  29. A unified analysis of extra-gradient and optimistic gradient methods for saddle point problems: Proximal point approach. In International Conference on Artificial Intelligence and Statistics, pages 1497–1507. PMLR, 2020.
  30. Shai Shalev-Shwartz et al. Online learning and online convex optimization. Foundations and Trends® in Machine Learning, 4(2):107–194, 2012.
  31. J v. Neumann. Zur theorie der gesellschaftsspiele. Mathematische annalen, 100(1):295–320, 1928.
  32. Eric Van Damme. Stability and perfection of Nash equilibria, volume 339. Springer, 1991.
  33. Theory of games and economic behavior. Princeton university press, 2nd edition, 1947.
  34. Linear last-iterate convergence in constrained saddle-point optimization. In International Conference on Learning Representations, 2021.
Citations (1)

Summary

We haven't generated a summary for this paper yet.