Is Deep Reinforcement Learning Really Superhuman on Atari?

08/13/2019
by   Marin Toromanoff, et al.
7

Consistent and reproducible evaluation of Deep Reinforcement Learning (DRL) is not straightforward. In the Arcade Learning Environment (ALE), small changes in environment parameters such as stochasticity or the maximum allowed play time can lead to very different performance. In this work, we discuss the difficulties of comparing different agents trained on ALE. In order to take a step further towards reproducible and comparable DRL, we introduce SABER, a Standardized Atari BEnchmark for general Reinforcement learning algorithms. Our methodology extends previous recommendations and contains a complete set of environment parameters as well as train and test procedures. We then use SABER to evaluate the current state of the art, Rainbow. Furthermore, we introduce a human world records baseline, and argue that previous claims of expert or superhuman performance of DRL might not be accurate. Finally, we propose Rainbow-IQN by extending Rainbow with Implicit Quantile Networks (IQN) leading to new state-of-the-art performance. Source code will be made available for reproducibility.

READ FULL TEXT

page 2

page 7

page 8

page 14

research
06/24/2019

Modern Deep Reinforcement Learning Algorithms

Recent advances in Reinforcement Learning, grounded on combining classic...
research
12/26/2018

A New Concept of Deep Reinforcement Learning based Augmented General Sequence Tagging System

In this paper, a new deep reinforcement learning based augmented general...
research
12/19/2020

Minimax Strikes Back

Deep Reinforcement Learning (DRL) reaches a superhuman level of play in ...
research
11/22/2020

Distributed Deep Reinforcement Learning: An Overview

Deep reinforcement learning (DRL) is a very active research area. Howeve...
research
06/30/2020

Evaluating the Performance of Reinforcement Learning Algorithms

Performance evaluations are critical for quantifying algorithmic advance...
research
07/02/2021

SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents

Building embodied autonomous agents capable of participating in social i...
research
07/16/2018

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Deep Reinforcement Learning (DRL) has achieved impressive success in man...

Please sign up or login with your details

Forgot password? Click here to reset