Approximating Poker Probabilities with Deep Learning

08/22/2018
by   Brandon Da Silva, et al.
0

Many poker systems, whether created with heuristics or machine learning, rely on the probability of winning as a key input. However calculating the precise probability using combinatorics is an intractable problem, so instead we approximate it. Monte Carlo simulation is an effective technique that can be used to approximate the probability that a player will win and/or tie a hand. However, without the use of a memory-intensive lookup table or a supercomputer, it becomes infeasible to run millions of times when training an agent with self-play. To combat the space-time tradeoff, we use deep learning to approximate the probabilities obtained from the Monte Carlo simulation with high accuracy. The learned model proves to be a lightweight alternative to Monte Carlo simulation, which ultimately allows us to use the probabilities as inputs during self-play efficiently. The source code and stored neural network can be found at https://github.com/brandinho/Poker-Probability-Approximation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2018

Approximate Bayesian Computation via Population Monte Carlo and Classification

Approximate Bayesian computation (ABC) methods can be used to sample fro...
research
12/22/2022

Scaffolding Generation using a 3D Physarum Polycephalum Simulation

In this demo, we present a novel technique for approximating topological...
research
01/25/2019

Evaluation Function Approximation for Scrabble

The current state-of-the-art Scrabble agents are not learning-based but ...
research
07/29/2023

Neural Classifiers based Monte Carlo simulation

Acceptance-rejection (AR), Independent Metropolis Hastings (IMH) or impo...
research
05/07/2021

Deep Learning Hamiltonian Monte Carlo

We generalize the Hamiltonian Monte Carlo algorithm with a stack of neur...
research
08/03/2019

A hierarchical neural hybrid method for failure probability estimation

Failure probability evaluation for complex physical and engineering syst...
research
12/25/2020

On partial information retrieval: the unconstrained 100 prisoner problem

We consider the classical 100 Prisoner problem and its variant, involvin...

Please sign up or login with your details

Forgot password? Click here to reset