Satisfiability Bounds for ω-Regular Properties in Bounded-Parameter Markov Decision Processes

07/27/2022
by   Jan Křetínský, et al.
0

We consider the problem of computing minimum and maximum probabilities of satisfying an ω-regular property in a bounded-parameter Markov decision process (BMDP). BMDP arise from Markov decision processes (MDP) by allowing for uncertainty on the transition probabilities in the form of intervals where the actual probabilities are unknown. ω-regular languages form a large class of properties, expressible as, e.g., Rabin or parity automata, encompassing rich specifications such as linear temporal logic. In a BMDP the probability to satisfy the property depends on the unknown transitions probabilities as well as on the policy. In this paper, we compute the extreme values. This solves the problem specifically suggested by Dutreix and Coogan in CDC 2018, extending their results on interval Markov chains with no adversary. The main idea is to reinterpret their work as analysis of interval MDP and accordingly the BMDP problem as analysis of an ω-regular stochastic game, where a solution is provided. This method extends smoothly further to bounded-parameter stochastic games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2020

The Complexity of Reachability in Parametric Markov Decision Processes

This article presents the complexity of reachability decision problems f...
research
05/14/2021

Efficient PAC Reinforcement Learning in Regular Decision Processes

Recently regular decision processes have been proposed as a well-behaved...
research
04/27/2022

Bounds for Synchronizing Markov Decision Processes

We consider Markov decision processes with synchronizing objectives, whi...
research
07/10/2020

Improved Analysis of UCRL2 with Empirical Bernstein Inequality

We consider the problem of exploration-exploitation in communicating Mar...
research
10/23/2019

Farkas certificates and minimal witnesses for probabilistic reachability constraints

This paper introduces Farkas certificates for lower and upper bounds on ...
research
03/16/2019

Parameter Synthesis for Markov Models

Markov chain analysis is a key technique in reliability engineering. A p...
research
11/02/2022

Interval Markov Decision Processes with Continuous Action-Spaces

Interval Markov Decision Processes (IMDPs) are uncertain Markov models, ...

Please sign up or login with your details

Forgot password? Click here to reset