Investigating Generalisation in Continuous Deep Reinforcement Learning

02/19/2019
by   Chenyang Zhao, et al.
8

Deep Reinforcement Learning has shown great success in a variety of control tasks. However, it is unclear how close we are to the vision of putting Deep RL into practice to solve real world problems. In particular, common practice in the field is to train policies on largely deterministic simulators and to evaluate algorithms through training performance alone, without a train/test distinction to ensure models generalise and are not overfitted. Moreover, it is not standard practice to check for generalisation under domain shift, although robustness to such system change between training and testing would be necessary for real-world Deep RL control, for example, in robotics. In this paper we study these issues by first characterising the sources of uncertainty that provide generalisation challenges in Deep RL. We then provide a new benchmark and thorough empirical evaluation of generalisation challenges for state of the art Deep RL methods. In particular, we show that, if generalisation is the goal, then common practice of evaluating algorithms based on their training performance leads to the wrong conclusions about algorithm choice. Finally, we evaluate several techniques for improving generalisation and draw conclusions about the most robust techniques to date.

READ FULL TEXT
research
02/04/2021

How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned

Deep reinforcement learning (RL) has emerged as a promising approach for...
research
08/30/2021

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Deep reinforcement learning (RL) algorithms are predominantly evaluated ...
research
11/18/2021

A Survey of Generalisation in Deep Reinforcement Learning

The study of generalisation in deep Reinforcement Learning (RL) aims to ...
research
11/03/2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

Reinforcement learning (RL) for continuous control typically employs dis...
research
09/13/2023

Investigating the Impact of Action Representations in Policy Gradient Algorithms

Reinforcement learning (RL) is a versatile framework for learning to sol...
research
10/25/2020

How to Make Deep RL Work in Practice

In recent years, challenging control problems became solvable with deep ...
research
04/20/2023

Efficient Deep Reinforcement Learning Requires Regulating Overfitting

Deep reinforcement learning algorithms that learn policies by trial-and-...

Please sign up or login with your details

Forgot password? Click here to reset