Transformer Based Planning in the Observation Space with Applications to Trick Taking Card Games (2404.13150v1)

Published 19 Apr 2024 in cs.AI and cs.LG

Abstract: Traditional search algorithms have issues when applied to games of imperfect information where the number of possible underlying states and trajectories are very large. This challenge is particularly evident in trick-taking card games. While state sampling techniques such as Perfect Information Monte Carlo (PIMC) search has shown success in these contexts, they still have major limitations. We present Generative Observation Monte Carlo Tree Search (GO-MCTS), which utilizes MCTS on observation sequences generated by a game specific model. This method performs the search within the observation space and advances the search using a model that depends solely on the agent's observations. Additionally, we demonstrate that transformers are well-suited as the generative model in this context, and we demonstrate a process for iteratively training the transformer via population-based self-play. The efficacy of GO-MCTS is demonstrated in various games of imperfect information, such as Hearts, Skat, and "The Crew: The Quest for Planet Nine," with promising results.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/HyperMindAI/status/1782658955212099733

Transformer Based Planning in the Observation Space with Applications to Trick Taking Card Games (2404.13150v1)

Summary

Related Papers

Tweets