Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning

01/30/2023
by   James Queeney, et al.
0

Many real-world domains require safe decision making in the presence of uncertainty. In this work, we propose a deep reinforcement learning framework for approaching this important problem. We consider a risk-averse perspective towards model uncertainty through the use of coherent distortion risk measures, and we show that our formulation is equivalent to a distributionally robust safe reinforcement learning problem with robustness guarantees on performance and safety. We propose an efficient implementation that only requires access to a single training environment, and we demonstrate that our framework produces robust, safe performance on a variety of continuous control tasks with safety constraints in the Real-World Reinforcement Learning Suite.

READ FULL TEXT
research
01/31/2023

Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees

Robustness and safety are critical for the trustworthy deployment of dee...
research
06/14/2022

Robust Reinforcement Learning with Distributional Risk-averse formulation

Robust Reinforcement Learning tries to make predictions more robust to c...
research
09/22/2017

OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World

While deep reinforcement learning techniques have recently produced cons...
research
05/18/2023

Bayesian Risk-Averse Q-Learning with Streaming Observations

We consider a robust reinforcement learning problem, where a learning ag...
research
09/16/2022

Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability

A trustworthy reinforcement learning algorithm should be competent in so...
research
08/13/2021

Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

The last half-decade has seen a steep rise in the number of contribution...
research
10/09/2019

Ctrl-Z: Recovering from Instability in Reinforcement Learning

When learning behavior, training data is often generated by the learner ...

Please sign up or login with your details

Forgot password? Click here to reset