Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm

04/13/2018
by   Edon Kelmendi, et al.
0

Simple stochastic games can be solved by value iteration (VI), which yields a sequence of under-approximations of the value of the game. This sequence is guaranteed to converge to the value only in the limit. Since no stopping criterion is known, this technique does not provide any guarantees on its results. We provide the first stopping criterion for VI on simple stochastic games. It is achieved by additionally computing a convergent sequence of over-approximations of the value, relying on an analysis of the game graph. Consequently, VI becomes an anytime algorithm returning the approximation of the value and the current error bound. As another consequence, we can provide a simulation-based asynchronous VI algorithm, which yields the same guarantees, but without necessarily exploring the whole game graph.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Stopping Criteria for Value and Strategy Iteration on Concurrent Stochastic Reachability Games

We consider concurrent stochastic games played on graphs with reachabili...
research
07/29/2022

Optimistic and Topological Value Iteration for Simple Stochastic Games

While value iteration (VI) is a standard solution approach to simple sto...
research
07/16/2020

Polyhedral value iteration for discounted games and energy games

We present a deterministic algorithm, solving discounted games with n no...
research
02/09/2021

A Generic Strategy Iteration Method for Simple Stochastic Games

We present a generic strategy iteration algorithm (GSIA) to find an opti...
research
08/14/2019

Approximating Values of Generalized-Reachability Stochastic Games

Simple stochastic games are turn-based 2.5-player games with a reachabil...
research
07/03/2019

Computing Probabilistic Bisimilarity Distances for Probabilistic Automata

The probabilistic bisimilarity distance of Deng et al. has been proposed...
research
09/16/2014

Compute Less to Get More: Using ORC to Improve Sparse Filtering

Sparse Filtering is a popular feature learning algorithm for image class...

Please sign up or login with your details

Forgot password? Click here to reset