Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games (2411.00954v1)

Published 1 Nov 2024 in cs.GT

Abstract: Extensive-Form Game (EFG) represents a fundamental model for analyzing sequential interactions among multiple agents and the primary challenge to solve it lies in mitigating sample complexity. Existing research indicated that Double Oracle (DO) can reduce the sample complexity dependence on the information set number $|S|$ to the final restricted game size $X$ in solving EFG. This is attributed to the early convergence of full-game Nash Equilibrium (NE) through iteratively solving restricted games. However, we prove that the state-of-the-art Extensive-Form Double Oracle (XDO) exhibits \textit{exponential} sample complexity of $X$, due to its exponentially increasing restricted game expansion frequency. Here we introduce Adaptive Double Oracle (AdaDO) to significantly alleviate sample complexity to \textit{polynomial} by deploying the optimal expansion frequency. Furthermore, to comprehensively study the principles and influencing factors underlying sample complexity, we introduce a novel theoretical framework Regret-Minimizing Double Oracle (RMDO) to provide directions for designing efficient DO algorithms. Empirical results demonstrate that AdaDO attains the more superior approximation of NE with less sample complexity than the strong baselines including Linear CFR, MCCFR and existing DO. Importantly, combining RMDO with warm starting and stochastic regret minimization further improves convergence rate and scalability, thereby paving the way for addressing complex multi-agent tasks.

Collections

Sign up for free to add this paper to one or more collections.

Sign Up

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Related Papers

Anytime PSRO for Two-Player Zero-Sum Games (2022)
XDO: A Double Oracle Algorithm for Extensive-Form Games (2021)
Online Double Oracle (2021)
Single Deep Counterfactual Regret Minimization (2019)
Regret-Minimizing Double Oracle for Extensive-Form Games (2023)

Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games (2411.00954v1)

Collections

Summary

Follow-up Questions

Authors (6)

Don't miss out on important new AI/ML research