Safer Reinforcement Learning through Transferable Instinct Networks

07/14/2021
by   Djordje Grbic, et al.
0

Random exploration is one of the main mechanisms through which reinforcement learning (RL) finds well-performing policies. However, it can lead to undesirable or catastrophic outcomes when learning online in safety-critical environments. In fact, safe learning is one of the major obstacles towards real-world agents that can learn during deployment. One way of ensuring that agents respect hard limitations is to explicitly configure boundaries in which they can operate. While this might work in some cases, we do not always have clear a-priori information which states and actions can lead dangerously close to hazardous states. Here, we present an approach where an additional policy can override the main policy and offer a safer alternative action. In our instinct-regulated RL (IR^2L) approach, an "instinctual" network is trained to recognize undesirable situations, while guarding the learning policy against entering them. The instinct network is pre-trained on a single task where it is safe to make mistakes, and transferred to environments in which learning a new task safely is critical. We demonstrate IR^2L in the OpenAI Safety gym domain, in which it receives a significantly lower number of safety violations during training than a baseline RL approach while reaching similar task performance.

READ FULL TEXT

page 5

page 6

research
02/21/2021

Safe Reinforcement Learning Using Robust Action Governor

Reinforcement Learning (RL) is essentially a trial-and-error learning pr...
research
06/05/2023

Conformal Predictive Safety Filter for RL Controllers in Dynamic Environments

The interest in using reinforcement learning (RL) controllers in safety-...
research
02/19/2022

Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake

Agents that operate in an unknown environment are bound to make mistakes...
research
05/06/2020

Safe Reinforcement Learning through Meta-learned Instincts

An important goal in reinforcement learning is to create agents that can...
research
09/19/2022

Measuring Interventional Robustness in Reinforcement Learning

Recent work in reinforcement learning has focused on several characteris...
research
07/15/2021

Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning

Despite recent advances in reinforcement learning (RL), its application ...
research
02/02/2023

Imitating careful experts to avoid catastrophic events

RL is increasingly being used to control robotic systems that interact c...

Please sign up or login with your details

Forgot password? Click here to reset