Learning a Low-dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

10/19/2020
by   Zhehua Zhou, et al.
0

For safely applying reinforcement learning algorithms on high-dimensional nonlinear dynamical systems, a simplified system model is used to formulate a safe reinforcement learning framework. Based on the simplified system model, a low-dimensional representation of the safe region is identified and is used to provide safety estimates for learning algorithms. However, finding a satisfying simplified system model for complex dynamical systems usually requires a considerable amount of effort. To overcome this limitation, we propose in this work a general data-driven approach that is able to efficiently learn a low-dimensional representation of the safe region. Through an online adaptation method, the low-dimensional representation is updated by using the feedback data such that more accurate safety estimates are obtained. The performance of the proposed approach for identifying the low-dimensional representation of the safe region is demonstrated with a quadcopter example. The results show that, compared to previous work, a more reliable and representative low-dimensional representation of the safe region is derived, which then extends the applicability of the safe reinforcement learning framework.

READ FULL TEXT

page 1

page 10

page 11

page 12

research
09/10/2021

Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Safe reinforcement learning aims to learn a control policy while ensurin...
research
08/23/2023

How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy

End-to-end learning has emerged as a major paradigm for developing auton...
research
11/24/2020

Safely Learning Dynamical Systems from Short Trajectories

A fundamental challenge in learning to control an unknown dynamical syst...
research
12/13/2018

Safe exploration of nonlinear dynamical systems: A predictive safety filter for reinforcement learning

Despite fast progress in Reinforcement Learning (RL), the transfer into ...
research
02/02/2023

Convolutional Autoencoders, Clustering and POD for Low-dimensional Parametrization of Navier-Stokes Equations

Simulations of large-scale dynamical systems require expensive computati...
research
06/12/2020

SAMBA: Safe Model-Based Active Reinforcement Learning

In this paper, we propose SAMBA, a novel framework for safe reinforcemen...
research
01/24/2022

Scalable Safe Exploration for Global Optimization of Dynamical Systems

Learning optimal control policies directly on physical systems is challe...

Please sign up or login with your details

Forgot password? Click here to reset