Stopping Criteria for Value and Strategy Iteration on Concurrent Stochastic Reachability Games

09/18/2019
by   Julia Eisentraut, et al.
0

We consider concurrent stochastic games played on graphs with reachability and safety objectives. These games can be solved by value iteration as well as strategy iteration, each of them yielding a sequence of under-approximations of the reachability value and a sequence of over-approximation of the safety value, converging to it in the limit. For both approaches, we provide the first (anytime) algorithms with stopping criteria. The stopping criterion for value iteration is based on providing a convergent sequence of over-approximations, which then allows to estimate the distance to the true value. For strategy iteration, we bound the error by complementing the strategy iteration algorithm for reachability by a new strategy iteration algorithm under-approximating the safety-value.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2018

Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm

Simple stochastic games can be solved by value iteration (VI), which yie...
research
06/11/2018

Reachability for Branching Concurrent Stochastic Games

We give polynomial time algorithms for deciding almost-sure and limit-su...
research
04/19/2023

Stopping Criteria for Value Iteration on Stochastic Games with Quantitative Objectives

A classic solution technique for Markov decision processes (MDP) and sto...
research
07/15/2020

Widest Paths and Global Propagation in Bounded Value Iteration for Stochastic Games

Solving stochastic games with the reachability objective is a fundamenta...
research
07/04/2012

Point-Based POMDP Algorithms: Improved Analysis and Implementation

Existing complexity bounds for point-based POMDP value iteration algorit...
research
08/21/2020

Comparison of Algorithms for Simple Stochastic Games (Full Version)

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
07/29/2022

Optimistic and Topological Value Iteration for Simple Stochastic Games

While value iteration (VI) is a standard solution approach to simple sto...

Please sign up or login with your details

Forgot password? Click here to reset