Safe Reinforcement Learning with Chance-constrained Model Predictive Control

12/27/2021
by   Samuel Pfrommer, et al.
0

Real-world reinforcement learning (RL) problems often demand that agents behave safely by obeying a set of designed constraints. We address the challenge of safe RL by coupling a safety guide based on model predictive control (MPC) with a modified policy gradient framework in a linear setting with continuous actions. The guide enforces safe operation of the system by embedding safety requirements as chance constraints in the MPC formulation. The policy gradient training step then includes a safety penalty which trains the base policy to behave safely. We show theoretically that this penalty allows for the safety guide to be removed after training and illustrate our method using experiments with a simulator quadrotor.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2023

Safe Reinforcement Learning via Probabilistic Logic Shields

Safe Reinforcement learning (Safe RL) aims at learning optimal policies ...
research
04/02/2020

Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?

For all its successes, Reinforcement Learning (RL) still struggles to de...
research
07/26/2023

Reinforcement Learning by Guided Safe Exploration

Safety is critical to broadening the application of reinforcement learni...
research
07/19/2021

Constrained Policy Gradient Method for Safe and Fast Reinforcement Learning: a Neural Tangent Kernel Based Approach

This paper presents a constrained policy gradient algorithm. We introduc...
research
12/13/2018

Safe exploration of nonlinear dynamical systems: A predictive safety filter for reinforcement learning

Despite fast progress in Reinforcement Learning (RL), the transfer into ...
research
11/11/2019

Driving Reinforcement Learning with Models

Over the years, Reinforcement Learning (RL) established itself as a conv...
research
04/06/2021

MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage

In this paper, we are interested in optimal control problems with purely...

Please sign up or login with your details

Forgot password? Click here to reset