Benchmark for Out-of-Distribution Detection in Deep Reinforcement Learning

12/05/2021
by   Aaqib Parvez Mohammed, et al.
0

Reinforcement Learning (RL) based solutions are being adopted in a variety of domains including robotics, health care and industrial automation. Most focus is given to when these solutions work well, but they fail when presented with out of distribution inputs. RL policies share the same faults as most machine learning models. Out of distribution detection for RL is generally not well covered in the literature, and there is a lack of benchmarks for this task. In this work we propose a benchmark to evaluate OOD detection methods in a Reinforcement Learning setting, by modifying the physical parameters of non-visual standard environments or corrupting the state observation for visual environments. We discuss ways to generate custom RL environments that can produce OOD data, and evaluate three uncertainty methods for the OOD detection task. Our results show that ensemble methods have the best OOD detection performance with a lower standard deviation across multiple environments.

READ FULL TEXT

page 6

page 12

research
09/27/2021

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

The progress in deep reinforcement learning (RL) is heavily driven by th...
research
01/19/2023

A Survey of Meta-Reinforcement Learning

While deep reinforcement learning (RL) has fueled multiple high-profile ...
research
11/14/2018

Natural Environment Benchmarks for Reinforcement Learning

While current benchmark reinforcement learning (RL) tasks have been usef...
research
07/11/2021

Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results

We study the problem of out-of-distribution dynamics (OODD) detection, w...
research
03/08/2021

Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement Learning

This letter compares the performance of four different, popular simulati...
research
06/01/2023

Augmented Modular Reinforcement Learning based on Heterogeneous Knowledge

In order to mitigate some of the inefficiencies of Reinforcement Learnin...
research
06/29/2020

Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper

Industry 4.0 systems have a high demand for optimization in their tasks,...

Please sign up or login with your details

Forgot password? Click here to reset