Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications

08/03/2021
by   Junya Ikemoto, et al.
6

We present a novel deep reinforcement learning (DRL)-based design of a networked controller with network delays for signal temporal logic (STL) specifications. We consider the case in which both the system dynamics and network delays are unknown. Because the satisfaction of an STL formula is based not only on the current state but also on the behavior of the system, we propose an extension of the Markov decision process (MDP), which is called a τδ-MDP, such that we can evaluate the satisfaction of the STL formula under the network delays using the τδ-MDP. Thereafter, we construct deep neural networks based on the τδ-MDP and propose a learning algorithm. Through simulations, we also demonstrate the learning performance of the proposed algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2022

Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Deep reinforcement learning (DRL) has attracted much attention as an app...
research
06/30/2020

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning

This paper introduces MDP homomorphic networks for deep reinforcement le...
research
05/16/2020

Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning

The lifelong control problem of an off-grid microgrid is composed of two...
research
01/14/2020

Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Büchi Automata

This letter proposes a novel reinforcement learning method for the synth...
research
06/03/2019

Decentralized Deep Reinforcement Learning for Delay-Power Tradeoff in Vehicular Communications

This paper targets at the problem of radio resource management for expec...
research
10/16/2022

The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning

Evaluations of Deep Reinforcement Learning (DRL) methods are an integral...
research
06/25/2021

Non-Parametric Neuro-Adaptive Control Subject to Task Specifications

We develop a learning-based algorithm for the control of robotic systems...

Please sign up or login with your details

Forgot password? Click here to reset