Learning Probabilistic Temporal Safety Properties from Examples in Relational Domains

11/07/2022
by   Gavin Rens, et al.
0

We propose a framework for learning a fragment of probabilistic computation tree logic (pCTL) formulae from a set of states that are labeled as safe or unsafe. We work in a relational setting and combine ideas from relational Markov Decision Processes with pCTL model-checking. More specifically, we assume that there is an unknown relational pCTL target formula that is satisfied by only safe states, and has a horizon of maximum k steps and a threshold probability α. The task then consists of learning this unknown formula from states that are labeled as safe or unsafe by a domain expert. We apply principles of relational learning to induce a pCTL formula that is satisfied by all safe states and none of the unsafe ones. This formula can then be used as a safety specification for this domain, so that the system can avoid getting into dangerous situations in future. Following relational learning principles, we introduce a candidate formula generation process, as well as a method for deciding which candidate formula is a satisfactory specification for the given labeled states. The cases where the expert knows and does not know the system policy are treated, however, much of the learning process is the same for both cases. We evaluate our approach on a synthetic relational domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2021

Lifted Model Checking for Relational MDPs

Model checking has been developed for verifying the behaviour of systems...
research
04/01/2020

Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

Probabilistic Computation Tree Logic (PCTL) is frequently used to formal...
research
01/15/2020

Domain-Liftability of Relational Marginal Polytopes

We study computational aspects of relational marginal polytopes which ar...
research
10/22/2017

Safety-Aware Apprenticeship Learning

Apprenticeship learning (AL) is a class of "learning from demonstrations...
research
09/12/2018

Safe Exploration in Markov Decision Processes with Time-Variant Safety using Spatio-Temporal Gaussian Process

In many real-world applications (e.g., planetary exploration, robot navi...
research
11/09/2018

Reachability-based safe learning for optimal control problem

In this work we seek for an approach to integrate safety in the learning...
research
04/17/2018

VC-Dimension Based Generalization Bounds for Relational Learning

In many applications of relational learning, the available data can be s...

Please sign up or login with your details

Forgot password? Click here to reset