2000 character limit reached
Balancing the AI Strength of Roles in Self-Play Training with Regret Matching+ (2401.12557v2)
Published 23 Jan 2024 in cs.AI
Abstract: When training artificial intelligence for games encompassing multiple roles, the development of a generalized model capable of controlling any character within the game presents a viable option. This strategy not only conserves computational resources and time during the training phase but also reduces resource requirements during deployment. training such a generalized model often encounters challenges related to uneven capabilities when controlling different roles. A simple method is introduced based on Regret Matching+, which facilitates a more balanced performance of strength by the model when controlling various roles.
- “A simple adaptive procedure leading to correlated equilibrium” In Econometrica 68.5 Wiley Online Library, 2000, pp. 1127–1150
- “Mastering the game of Go with deep neural networks and tree search” In nature 529.7587 Nature Publishing Group, 2016, pp. 484–489
- Oskari Tammelin “Solving large imperfect information games using CFR+” In arXiv preprint arXiv:1407.5042, 2014
- Gerald Tesauro “Temporal difference learning and TD-Gammon” In Communications of the ACM 38.3, 1995, pp. 58–68