Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning (2204.02558v1)

Published 6 Apr 2022 in cs.AI and cs.LG

Abstract: Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games. Among these games, DouDizhu, a popular card game in China, is very challenging due to the imperfect information, large state space, elements of collaboration and a massive number of possible moves from turn to turn. Recently, a DouDizhu AI system called DouZero has been proposed. Trained using traditional Monte Carlo method with deep neural networks and self-play procedure without the abstraction of human prior knowledge, DouZero has outperformed all the existing DouDizhu AI programs. In this work, we propose to enhance DouZero by introducing opponent modeling into DouZero. Besides, we propose a novel coach network to further boost the performance of DouZero and accelerate its training process. With the integration of the above two techniques into DouZero, our DouDizhu AI system achieves better performance and ranks top in the Botzone leaderboard among more than 400 AI agents, including DouZero.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Youpeng Zhao (16 papers)
  2. Jian Zhao (218 papers)
  3. Xunhan Hu (8 papers)
  4. Wengang Zhou (153 papers)
  5. Houqiang Li (236 papers)
Citations (14)

Summary

We haven't generated a summary for this paper yet.