DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

04/06/2022
by   Youpeng Zhao, et al.
5

Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games. Among these games, DouDizhu, a popular card game in China, is very challenging due to the imperfect information, large state space, elements of collaboration and a massive number of possible moves from turn to turn. Recently, a DouDizhu AI system called DouZero has been proposed. Trained using traditional Monte Carlo method with deep neural networks and self-play procedure without the abstraction of human prior knowledge, DouZero has outperformed all the existing DouDizhu AI programs. In this work, we propose to enhance DouZero by introducing opponent modeling into DouZero. Besides, we propose a novel coach network to further boost the performance of DouZero and accelerate its training process. With the integration of the above two techniques into DouZero, our DouDizhu AI system achieves better performance and ranks top in the Botzone leaderboard among more than 400 AI agents, including DouZero.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2021

DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

Games are abstractions of the real world, where artificial agents learn ...
research
07/27/2020

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

The combination of deep reinforcement learning and search at both traini...
research
03/30/2022

PerfectDou: Dominating DouDizhu with Perfect Information Distillation

As a challenging multi-player card game, DouDizhu has recently drawn muc...
research
10/31/2022

DanZero: Mastering GuanDan Game with Reinforcement Learning

Card game AI has always been a hot topic in the research of artificial i...
research
12/03/2020

DeepCrawl: Deep Reinforcement Learning for Turn-based Strategy Games

In this paper we introduce DeepCrawl, a fully-playable Roguelike prototy...
research
01/24/2019

Combinational Q-Learning for Dou Di Zhu

Deep reinforcement learning (DRL) has gained a lot of attention in recen...
research
10/10/2022

Exploring Adaptive MCTS with TD Learning in miniXCOM

In recent years, Monte Carlo tree search (MCTS) has achieved widespread ...

Please sign up or login with your details

Forgot password? Click here to reset