PAC Statistical Model Checking of Mean Payoff in Discrete- and Continuous-Time MDP

06/03/2022
by   Chaitanya Agarwal, et al.
6

Markov decision processes (MDP) and continuous-time MDP (CTMDP) are the fundamental models for non-deterministic systems with probabilistic uncertainty. Mean payoff (a.k.a. long-run average reward) is one of the most classic objectives considered in their context. We provide the first algorithm to compute mean payoff probably approximately correctly in unknown MDP; further, we extend it to unknown CTMDP. We do not require any knowledge of the state space, only a lower bound on the minimum transition probability, which has been advocated in literature. In addition to providing probably approximately correct (PAC) bounds for our algorithm, we also demonstrate its practical nature by running experiments on standard benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2019

PAC Statistical Model Checking for Markov Decision Processes and Stochastic Games

Statistical model checking (SMC) is a technique for analysis of probabil...
research
04/05/2016

Bounded Optimal Exploration in MDP

Within the framework of probably approximately correct Markov decision p...
research
10/26/2020

Multi-objective Optimization of Long-run Average and Total Rewards

This paper presents an efficient procedure for multi-objective model che...
research
06/20/2017

Mean-Payoff Optimization in Continuous-Time Markov Chains with Parametric Alarms

Continuous-time Markov chains with alarms (ACTMCs) allow for alarm event...
research
03/10/2022

Data-driven Abstractions with Probabilistic Guarantees for Linear PETC Systems

We employ the scenario approach to compute probably approximately correc...
research
07/07/2023

PAC bounds of continuous Linear Parameter-Varying systems related to neural ODEs

We consider the problem of learning Neural Ordinary Differential Equatio...
research
11/24/2021

Reinforcement Learning for General LTL Objectives Is Intractable

In recent years, researchers have made significant progress in devising ...

Please sign up or login with your details

Forgot password? Click here to reset