Safe Opponent Exploitation For Epsilon Equilibrium Strategies

07/23/2023
by   Linus Jeary, et al.
0

In safe opponent exploitation players hope to exploit their opponents' potentially sub-optimal strategies while guaranteeing at least the value of the game in expectation for themselves. Safe opponent exploitation algorithms have been successfully applied to small instances of two-player zero-sum imperfect information games, where Nash equilibrium strategies are typically known in advance. Current methods available to compute these strategies are however not scalable to desirable large domains of imperfect information such as No-Limit Texas Hold 'em (NLHE) poker, where successful agents rely on game abstractions in order to compute an equilibrium strategy approximation. This paper will extend the concept of safe opponent exploitation by introducing prime-safe opponent exploitation, in which we redefine the value of the game of a player to be the worst-case payoff their strategy could be susceptible to. This allows weaker epsilon equilibrium strategies to benefit from utilising a form of opponent exploitation with our revised value of the game, still allowing for a practical game-theoretical guaranteed lower-bound. We demonstrate the empirical advantages of our generalisation when applied to the main safe opponent exploitation algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/12/2022

Safe Equilibrium

The standard game-theoretic solution concept, Nash equilibrium, assumes ...
research
06/08/2019

Most Important Fundamental Rule of Poker Strategy

Poker is a large complex game of imperfect information, which has been s...
research
07/17/2011

Computing Strong Game-Theoretic Strategies in Jotto

We develop a new approach that computes approximate equilibrium strategi...
research
04/22/2014

Finding safe strategies for competitive diffusion on trees

We study the two-player safe game of Competitive Diffusion, a game-theor...
research
09/18/2023

Desensitization and Deception in Differential Games with Asymmetric Information

Desensitization addresses safe optimal planning under parametric uncerta...
research
06/30/2018

Modeling Friends and Foes

How can one detect friendly and adversarial behavior from raw data? Dete...
research
06/10/2021

Subgame solving without common knowledge

In imperfect-information games, subgame solving is significantly more ch...

Please sign up or login with your details

Forgot password? Click here to reset