Reinforcement Learning Control of Constrained Dynamic Systems with Uniformly Ultimate Boundedness Stability Guarantee

11/13/2020
by   Minghao Han, et al.
10

Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. Without using a mathematical model, an optimal controller can be learned from data evaluated by certain performance criteria through trial-and-error. However, the data-based learning approach is notorious for not guaranteeing stability, which is the most fundamental property for any control system. In this paper, the classic Lyapunov's method is explored to analyze the uniformly ultimate boundedness stability (UUB) solely based on data without using a mathematical model. It is further shown how RL with UUB guarantee can be applied to control dynamic systems with safety constraints. Based on the theoretical results, both off-policy and on-policy learning algorithms are proposed respectively. As a result, optimal controllers can be learned to guarantee UUB of the closed-loop system both at convergence and during learning. The proposed algorithms are evaluated on a series of robotic continuous control tasks with safety constraints. In comparison with the existing RL algorithms, the proposed method can achieve superior performance in terms of maintaining safety. As a qualitative evaluation of stability, our method shows impressive resilience even in the presence of external disturbances.

READ FULL TEXT
research
04/29/2020

Actor-Critic Reinforcement Learning for Control with Stability Guarantee

Deep Reinforcement Learning (DRL) has achieved impressive performance in...
research
08/31/2022

A stabilizing reinforcement learning approach for sampled systems with partially unknown models

Reinforcement learning is commonly associated with training of reward-ma...
research
12/18/2021

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Recently, barrier function-based safe reinforcement learning (RL) with t...
research
07/11/2021

Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics

The lack of stability guarantee restricts the practical use of learning-...
research
03/05/2021

Lyapunov-Regularized Reinforcement Learning for Power System Transient Stability

Transient stability of power systems is becoming increasingly important ...
research
05/13/2021

Online Algorithms and Policies Using Adaptive and Machine Learning Approaches

This paper considers the problem of real-time control and learning in dy...
research
12/20/2019

Soft Q-network

When DQN is announced by deepmind in 2013, the whole world is surprised ...

Please sign up or login with your details

Forgot password? Click here to reset