Safe Exploration in Model-based Reinforcement Learning using Control Barrier Functions

04/16/2021
by   Max H. Cohen, et al.
0

This paper studies the problem of developing an approximate dynamic programming (ADP) framework for learning online the value function of an infinite-horizon optimal problem while obeying safety constraints expressed as control barrier functions (CBFs). Our approach is facilitated by the development of a novel class of CBFs, termed Lyapunov-like CBFs (LCBFs), that retain the beneficial properties of CBFs for developing minimally-invasive safe control policies while also possessing desirable Lyapunov-like qualities such as positive semi-definiteness. We show how these LCBFs can be used to augment a learning-based control policy so as to guarantee safety and then leverage this approach to develop a safe exploration framework in a model-based reinforcement learning setting. We demonstrate that our developed approach can handle more general safety constraints than state-of-the-art safe ADP methods through a variety of numerical examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2021

Safe Model-Based Reinforcement Learning Using Robust Control Barrier Functions

Reinforcement Learning (RL) is effective in many scenarios. However, it ...
research
02/24/2021

Towards Safe Continuing Task Reinforcement Learning

Safety is a critical feature of controller design for physical systems. ...
research
03/23/2019

Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

Using reinforcement learning to learn control policies is a challenge wh...
research
06/12/2020

SAMBA: Safe Model-Based Active Reinforcement Learning

In this paper, we propose SAMBA, a novel framework for safe reinforcemen...
research
11/04/2021

Infinite Time Horizon Safety of Bayesian Neural Networks

Bayesian neural networks (BNNs) place distributions over the weights of ...
research
11/26/2019

Control-Tutored Reinforcement Learning: an application to the Herding Problem

In this extended abstract we introduce a novel control-tutored Q-learni...
research
09/26/2022

FORESEE: Model-based Reinforcement Learning using Unscented Transform with application to Tuning of Control Barrier Functions

In this paper, we introduce a novel online model-based reinforcement lea...

Please sign up or login with your details

Forgot password? Click here to reset