Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods (2405.18703v2)

Published 29 May 2024 in cs.GT

Abstract: Many real-world decision problems involve the interaction of multiple self-interested agents with limited sensing ability. The partially observable stochastic game (POSG) provides a mathematical framework for modeling these problems, however solving a POSG requires difficult reasoning over two critical factors: (1) information revealed by partial observations and (2) decisions other agents make. In the single agent case, partially observable Markov decision process (POMDP) planning can efficiently address partial observability with particle filtering. In the multi-agent case, extensive form game solution methods account for other agent's decisions, but preclude belief approximation. We propose a unifying framework that combines POMDP-inspired state distribution approximation and game-theoretic equilibrium search on information sets. This paper lays a theoretical foundation for the approach by bounding errors due to belief approximation, and empirically demonstrates effectiveness with a numerical example. The new approach enables planning in POSGs with very large state spaces, paving the way for reliable autonomous interaction in real-world physical environments and complementing multi-agent reinforcement learning.

References (34)

Authors (2)

Tyler Becker (5 papers)
Zachary Sunberg (11 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods (2405.18703v2)

Summary

Related Papers

Tweets