Zero-Sum Two Person Perfect Information Semi-Markov Games: A Reduction

01/01/2022
by   S. Sinha, et al.
0

Look at the play of a Perfect Information Semi-Markov game (PISMG). As the game has perfect information, at each time point in any play, all but one player is a dummy. Hence on any particular time instant, at most one player has more than one action available to himself. Thus such games lack real conflict throughout its play and no player directly antagonizes another ever (in each state, the reward matrix is a row or column vector). Above intuition helps us to show that any zero-sum two person PISMG can be reduced to an one-player game, i.e., to a semi-Markov decision process (SMDP), which has a value (Sinha et al.,(2017) [14]). In this paper, we use limiting ratio average pay-off (but any standard pay-off function will do) and prove that any PISMG under such an undiscounted pay-off has a value and both the maximiser (player-I) and minimiser (player-II) have pure semi-stationary optimal strategies. To solve such an undiscounted PISMG, we apply Mondal's algorithm (2017, [11]) on the reduced SMDP obtained from the PISMG.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2022

On Non-Cooperative Perfect Information Semi-Markov Games

We show that an N-person non-cooperative semi-Markov game under limiting...
research
04/21/2021

Random perfect information games

The paper proposes a natural measure space of zero-sum perfect informati...
research
05/26/2019

SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go (extended version)

We develop a new model that can be applied to any perfect information tw...
research
10/04/2021

A Markov process approach to untangling intention versus execution in tennis

Value functions are used in sports applications to determine the optimal...
research
03/06/2021

Zero-Sum Semi-Markov Games with State-Action-Dependent Discount Factors

Semi-Markov model is one of the most general models for stochastic dynam...
research
10/29/2019

Multiplayer AlphaZero

The AlphaZero algorithm has achieved superhuman performance in two-playe...
research
10/20/2019

Leadership scenarios in prisoner's dilemma game

The prisoner's dilemma game is the most known contribution of game theor...

Please sign up or login with your details

Forgot password? Click here to reset