Log In Sign Up

ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning

by   Naichen Shi, et al.

People have made remarkable progress in game AIs, especially in domain of perfect information game. However, trick-taking poker game, as a popular form of imperfect information game, has been regarded as a challenge for a long time. Since trick-taking game requires high level of not only reasoning, but also inference to excel, it can be a new milestone for imperfect information game AI. We study Gongzhu, a trick-taking game analogous to, but slightly simpler than contract bridge. Nonetheless, the strategies of Gongzhu are complex enough for both human and computer players. We train a strong Gongzhu AI ScrofaZero from tabula rasa by deep reinforcement learning, while few previous efforts on solving trick-taking poker game utilize the representation power of neural networks. Also, we introduce new techniques for imperfect information game including stratified sampling, importance weighting, integral over equivalent class, Bayesian inference, etc. Our AI can achieve human expert level performance. The methodologies in building our program can be easily transferred into a wide range of trick-taking games.


Suphx: Mastering Mahjong with Deep Reinforcement Learning

Artificial Intelligence (AI) has achieved great success in many domains,...

Building a Computer Mahjong Player via Deep Convolutional Neural Networks

The evaluation function for imperfect information games is always hard t...

OpenHoldem: An Open Toolkit for Large-Scale Imperfect-Information Game Research

Owning to the unremitting efforts by a few institutes, significant progr...

Dota 2 with Large Scale Deep Reinforcement Learning

On April 13th, 2019, OpenAI Five became the first AI system to defeat th...

On the Power of Refined Skat Selection

Skat is a fascinating combinatorial card game, show-casing many of the i...

Competitive Bridge Bidding with Deep Neural Networks

The game of bridge consists of two stages: bidding and playing. While pl...

GIB: Imperfect Information in a Computationally Challenging Game

This paper investigates the problems arising in the construction of a pr...