Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty (2404.18909v3)

Published 29 Apr 2024 in cs.LG, stat.ML, and cs.MA

Abstract: To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely studied in single-agent regimes, in multi-agent environments, the problem remains understudied -- despite the fact that the problems posed by environmental uncertainties are often exacerbated by strategic interactions. This work focuses on learning in distributionally robust Markov games (RMGs), a robust variant of standard Markov games, wherein each agent aims to learn a policy that maximizes its own worst-case performance when the deployed environment deviates within its own prescribed uncertainty set. This results in a set of robust equilibrium strategies for all agents that align with classic notions of game-theoretic equilibria. Assuming a non-adaptive sampling mechanism from a generative model, we propose a sample-efficient model-based algorithm (DRNVI) with finite-sample complexity guarantees for learning robust variants of various notions of game-theoretic equilibria. We also establish an information-theoretic lower bound for solving RMGs, which confirms the near-optimal sample complexity of DRNVI with respect to problem-dependent factors such as the size of the state space, the target accuracy, and the horizon length.

Authors (4)

Laixi Shi (23 papers)
Eric Mazumdar (36 papers)
Yuejie Chi (109 papers)
Adam Wierman (132 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1788783208563159438

https://twitter.com/StatMLPapers/status/1785158135499436502

Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty (2404.18909v3)

Summary

Related Papers

Tweets