STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning

12/02/2022
by   Nikhil Kumar Singh, et al.
0

Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers (agents) for various complex systems with unknown dynamics. These systems are expected to satisfy diverse safety and liveness properties best captured using temporal logic. In RL, the reward function plays a crucial role in specifying the desired behaviour of these agents. However, the problem of designing the reward function for an RL agent to satisfy complex temporal logic specifications has received limited attention in the literature. To address this, we provide a systematic way of generating rewards in real-time by using the quantitative semantics of Signal Temporal Logic (STL), a widely used temporal logic to specify the behaviour of cyber-physical systems. We propose a new quantitative semantics for STL having several desirable properties, making it suitable for reward generation. We evaluate our STL-based reinforcement learning mechanism on several complex continuous control benchmarks and compare our STL semantics with those available in the literature in terms of their efficacy in synthesizing the controller agent. Experimental results establish our new semantics to be the most suitable for synthesizing feedback controllers for complex continuous dynamical systems through reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2023

Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications

There has been growing interest in deep reinforcement learning (DRL) alg...
research
10/25/2015

Safe Control under Uncertainty

Controller synthesis for hybrid systems that satisfy temporal specificat...
research
10/06/2021

From STL Rulebooks to Rewards

The automatic synthesis of neural-network controllers for autonomous age...
research
03/23/2021

Assured Learning-enabled Autonomy: A Metacognitive Reinforcement Learning Framework

Reinforcement learning (RL) agents with pre-specified reward functions c...
research
05/16/2019

A Correctness Result for Synthesizing Plans With Loops in Stochastic Domains

Finite-state controllers (FSCs), such as plans with loops, are powerful ...
research
03/08/2022

Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

We present a computational framework for synthesis of distributed contro...
research
01/21/2022

Deep Q-learning: a robust control approach

In this paper, we place deep Q-learning into a control-oriented perspect...

Please sign up or login with your details

Forgot password? Click here to reset