PerfectDou: Dominating DouDizhu with Perfect Information Distillation

03/30/2022
by   Guan Yang, et al.
0

As a challenging multi-player card game, DouDizhu has recently drawn much attention for analyzing competition and collaboration in imperfect-information games. In this paper, we propose PerfectDou, a state-of-the-art DouDizhu AI system that dominates the game, in an actor-critic framework with a proposed technique named perfect information distillation. In detail, we adopt a perfect-training-imperfect-execution framework that allows the agents to utilize the global information to guide the training of the policies as if it is a perfect information game and the trained policies can be used to play the imperfect information game during the actual gameplay. To this end, we characterize card and game features for DouDizhu to represent the perfect and imperfect information. To train our system, we adopt proximal policy optimization with generalized advantage estimation in a parallel training paradigm. In experiments we show how and why PerfectDou beats all existing AI programs, and achieves state-of-the-art performance.

READ FULL TEXT

page 7

page 13

research
12/06/2021

Player of Games

Games have a long history of serving as a benchmark for progress in arti...
research
10/01/2021

Dynamics of targeted ransomware negotiations

In this paper, we consider how the development of targeted ransomware ha...
research
04/06/2022

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

Recent years have witnessed the great breakthrough of deep reinforcement...
research
08/14/2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

To learn good joint policies for multi-agent collaboration with imperfec...
research
06/11/2021

DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

Games are abstractions of the real world, where artificial agents learn ...
research
12/22/2021

Alpha-Mini: Minichess Agent with Deep Reinforcement Learning

We train an agent to compete in the game of Gardner minichess, a downsiz...
research
06/14/2019

Problems with the EFG formalism: a solution attempt using observations

We argue that the extensive-form game (EFG) model isn't powerful enough ...

Please sign up or login with your details

Forgot password? Click here to reset