Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

02/08/2023
by   Quentin Cohen-Solal, et al.
0

In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games. We propose two ways of doing this, the first way generalizes the search algorithm, i.e. Descent, to stochastic games and the second way approximates stochastic games by deterministic games. We then evaluate them on the game EinStein wurfelt nicht! against state-of-the-art algorithms: Expectiminimax and Polygames (i.e. the Alpha Zero algorithm). It is our generalization of Descent which obtains the best results. The approximation by deterministic games nevertheless obtains good results, presaging that it could give better results in particular contexts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2021

Well-Founded Extensive Games with Perfect Information

We consider extensive games with perfect information with well-founded g...
research
12/06/2021

Player of Games

Games have a long history of serving as a benchmark for progress in arti...
research
10/29/2019

Multiplayer AlphaZero

The AlphaZero algorithm has achieved superhuman performance in two-playe...
research
07/03/2003

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

We present BL-WoLF, a framework for learnability in repeated zero-sum ga...
research
11/18/2019

The αμ Search Algorithm for the Game of Bridge

αμ is an anytime heuristic search algorithm for incomplete information g...
research
01/26/2022

Different strokes in randomised strategies: Revisiting Kuhn's theorem under finite-memory assumptions

Two-player (antagonistic) games on (possibly stochastic) graphs are a pr...
research
05/27/2019

Policy Based Inference in Trick-Taking Card Games

Trick-taking card games feature a large amount of private information th...

Please sign up or login with your details

Forgot password? Click here to reset