Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives

10/02/2020
by   Alper Kamil Bozkurt, et al.
0

We study the problem of synthesizing control strategies for Linear Temporal Logic (LTL) objectives in unknown environments. We model this problem as a turn-based zero-sum stochastic game between the controller and the environment, where the transition probabilities and the model topology are fully unknown. The winning condition for the controller in this game is the satisfaction of the given LTL specification, which can be captured by the acceptance condition of a deterministic Rabin automaton (DRA) directly derived from the LTL specification. We introduce a model-free reinforcement learning (RL) methodology to find a strategy that maximizes the probability of satisfying a given LTL specification when the Rabin condition of the derived DRA has a single accepting pair. We then generalize this approach to LTL formulas for which the Rabin condition has a larger number of accepting pairs, providing a lower bound on the satisfaction probability. Finally, we illustrate applicability of our RL method on two motion planning case studies.

READ FULL TEXT
research
09/16/2019

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

We present a reinforcement learning (RL) framework to synthesize a contr...
research
02/08/2021

Learning Optimal Strategies for Temporal Tasks in Stochastic Games

Linear temporal logic (LTL) is widely used to formally specify complex t...
research
11/03/2020

Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning

We consider the problem of security-aware planning in an unknown stochas...
research
03/26/2021

Model-Free Learning of Safe yet Effective Controllers

In this paper, we study the problem of learning safe control policies th...
research
09/21/2022

LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning

LCRL is a software tool that implements model-free Reinforcement Learnin...
research
07/29/2023

Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows

We propose an automata-theoretic approach for reinforcement learning (RL...
research
05/05/2023

Context-triggered Abstraction-based Control Design

We consider the problem of automatically synthesizing a hybrid controlle...

Please sign up or login with your details

Forgot password? Click here to reset