Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach

02/24/2020
by   Subin Huh, et al.
0

Emerging applications in robotics and autonomous systems, such as autonomous driving and robotic surgery, often involve critical safety constraints that must be satisfied even when information about system models is limited. In this regard, we propose a model-free safety specification method that learns the maximal probability of safe operation by carefully combining probabilistic reachability analysis and safe reinforcement learning (RL). Our approach constructs a Lyapunov function with respect to a safe policy to restrain each policy improvement stage. As a result, it yields a sequence of safe policies that determine the range of safe operation, called the safe set, which monotonically expands and gradually converges. We also develop an efficient safe exploration scheme that accelerates the process of identifying the safety of unexamined states. Exploiting the Lyapunov shielding, our method regulates the exploratory policy to avoid dangerous states with high confidence. To handle high-dimensional systems, we further extend our approach to deep RL by introducing a Lagrangian relaxation technique to establish a tractable actor-critic algorithm. The empirical performance of our method is demonstrated through continuous control benchmark problems, such as a reaching task on a planar robot arm.

READ FULL TEXT

page 1

page 7

page 8

research
07/04/2022

Safe Reinforcement Learning via Confidence-Based Filters

Ensuring safety is a crucial challenge when deploying reinforcement lear...
research
10/14/2021

Safety-aware Policy Optimisation for Autonomous Racing

To be viable for safety-critical applications, such as autonomous drivin...
research
05/16/2023

Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions

Reinforcement learning (RL) exhibits impressive performance when managin...
research
06/17/2022

SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving

Safe reinforcement learning (RL) has achieved significant success on ris...
research
10/20/2021

Bootstrapping confidence in future safety based on past safe operation

With autonomous vehicles (AVs), a major concern is the inability to give...
research
06/15/2020

Neural Certificates for Safe Control Policies

This paper develops an approach to learn a policy of a dynamical system ...
research
04/27/2023

Appropriateness is all you need!

The strive to make AI applications "safe" has led to the development of ...

Please sign up or login with your details

Forgot password? Click here to reset