Optimistic and Topological Value Iteration for Simple Stochastic Games

07/29/2022
by   Muqsit Azeem, et al.
0

While value iteration (VI) is a standard solution approach to simple stochastic games (SSGs), it suffered from the lack of a stopping criterion. Recently, several solutions have appeared, among them also "optimistic" VI (OVI). However, OVI is applicable only to one-player SSGs with no end components. We lift these two assumptions, making it available to general SSGs. Further, we utilize the idea in the context of topological VI, where we provide an efficient precise solution. In order to compare the new algorithms with the state of the art, we use not only the standard benchmarks, but we also design a random generator of SSGs, which can be biased towards various types of models, aiding in understanding the advantages of different algorithms on SSGs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2018

Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm

Simple stochastic games can be solved by value iteration (VI), which yie...
research
09/23/2020

Comparison of Algorithms for Simple Stochastic Games

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
08/21/2020

Comparison of Algorithms for Simple Stochastic Games (Full Version)

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
09/18/2019

Stopping Criteria for Value and Strategy Iteration on Concurrent Stochastic Reachability Games

We consider concurrent stochastic games played on graphs with reachabili...
research
04/19/2023

Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives

A classic solution technique for Markov decision processes (MDP) and sto...
research
07/20/2022

A Lattice-Theoretical View of Strategy Iteration

Strategy iteration is a technique frequently used for two-player games i...
research
07/15/2020

Widest Paths and Global Propagation in Bounded Value Iteration for Stochastic Games

Solving stochastic games with the reachability objective is a fundamenta...

Please sign up or login with your details

Forgot password? Click here to reset