A reinforcement learning approach to hybrid control design

09/02/2020
by   Meet Gandhi, et al.
0

In this paper we design hybrid control policies for hybrid systems whose mathematical models are unknown. Our contributions are threefold. First, we propose a framework for modelling the hybrid control design problem as a single Markov Decision Process (MDP). This result facilitates the application of off-the-shelf algorithms from Reinforcement Learning (RL) literature towards designing optimal control policies. Second, we model a set of benchmark examples of hybrid control design problem in the proposed MDP framework. Third, we adapt the recently proposed Proximal Policy Optimisation (PPO) algorithm for the hybrid action space and apply it to the above set of problems. It is observed that in each case the algorithm converges and finds the optimal policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2022

Hysteresis-Based RL: Robustifying Reinforcement Learning-based Control Policies via Hybrid Control

Reinforcement learning (RL) is a promising approach for deriving control...
research
02/20/2021

Importance of Environment Design in Reinforcement Learning: A Study of a Robotic Environment

An in-depth understanding of the particular environment is crucial in re...
research
08/08/2012

Hybrid systems modeling for gas transmission network

Gas Transmission Networks are large-scale complex systems, and correspon...
research
11/03/2020

Control with adaptive Q-learning

This paper evaluates adaptive Q-learning (AQL) and single-partition adap...
research
05/01/2022

Processing Network Controls via Deep Reinforcement Learning

Novel advanced policy gradient (APG) algorithms, such as proximal policy...
research
01/03/2022

Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism

The dynamic job-shop scheduling problem (DJSP) is a class of scheduling ...
research
01/02/2022

Reinforcement Learning for Task Specifications with Action-Constraints

In this paper, we use concepts from supervisory control theory of discre...

Please sign up or login with your details

Forgot password? Click here to reset