Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

04/01/2020
by   Yu Wang, et al.
1

Probabilistic Computation Tree Logic (PCTL) is frequently used to formally specify control objectives such as probabilistic reachability and safety. In this work, we focus on model checking PCTL specifications statistically on Markov Decision Processes (MDPs) by sampling, e.g., checking whether there exists a feasible policy such that the probability of reaching certain goal states is greater than a threshold. We use reinforcement learning to search for such a feasible policy for PCTL specifications, and then develop a statistical model checking (SMC) method with provable guarantees on its error. Specifically, we first use upper-confidence-bound (UCB) based Q-learning to design an SMC algorithm for bounded-time PCTL specifications, and then extend this algorithm to unbounded-time specifications by identifying a proper truncation time by checking the PCTL specification and its negation at the same time. Finally, we evaluate the proposed method on case studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2021

Linear-Time Model Checking Branching Processes

(Multi-type) branching processes are a natural and well-studied model fo...
research
04/14/2017

Environment-Independent Task Specifications via GLTL

We propose a new task-specification language for Markov decision process...
research
08/15/2023

Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods

We study how to efficiently combine formal methods, Monte Carlo Tree Sea...
research
07/08/2019

Bounded Model Checking of Max-Plus Linear Systems via Predicate Abstractions

This paper introduces the abstraction of max-plus linear (MPL) systems v...
research
10/05/2021

Adversarial Robustness Verification and Attack Synthesis in Stochastic Systems

Probabilistic model checking is a useful technique for specifying and ve...
research
11/07/2022

Learning Probabilistic Temporal Safety Properties from Examples in Relational Domains

We propose a framework for learning a fragment of probabilistic computat...
research
07/16/2018

Structured Synthesis for Probabilistic Systems

We introduce the concept of structured synthesis for Markov decision pro...

Please sign up or login with your details

Forgot password? Click here to reset