Measuring Progress in Deep Reinforcement Learning Sample Efficiency

02/09/2021
by   Florian E. Dorner, et al.
0

Sampled environment transitions are a critical input to deep reinforcement learning (DRL) algorithms. Current DRL benchmarks often allow for the cheap and easy generation of large amounts of samples such that perceived progress in DRL does not necessarily correspond to improved sample efficiency. As simulating real world processes is often prohibitively hard and collecting real world experience is costly, sample efficiency is an important indicator for economically relevant applications of DRL. We investigate progress in sample efficiency on Atari games and continuous control tasks by comparing the number of samples that a variety of algorithms need to reach a given performance level according to training curves in the corresponding publications. We find exponential progress in sample efficiency with estimated doubling times of around 10 to 18 months on Atari, 5 to 24 months on state-based continuous control and of around 4 to 9 months on pixel-based continuous control depending on the specific task and performance level.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2021

Deep Reinforcement Learning with Quantum-inspired Experience Replay

In this paper, a novel training paradigm inspired by quantum computation...
research
10/15/2020

Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control

While Deep Reinforcement Learning (DRL) has emerged as a promising appro...
research
10/16/2022

The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning

Evaluations of Deep Reinforcement Learning (DRL) methods are an integral...
research
01/27/2023

Neural Episodic Control with State Abstraction

Existing Deep Reinforcement Learning (DRL) algorithms suffer from sample...
research
07/07/2021

Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research

Deep Reinforcement Learning (DRL) is considered a potential framework to...
research
10/19/2019

Towards More Sample Efficiency inReinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
12/13/2017

Multi-focus Attention Network for Efficient Deep Reinforcement Learning

Deep reinforcement learning (DRL) has shown incredible performance in le...

Please sign up or login with your details

Forgot password? Click here to reset