A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions

12/08/2021
by   Jiajun Fan, et al.
0

The Arcade Learning Environment (ALE) is proposed as an evaluation platform for empirically assessing the generality of agents across dozens of Atari 2600 games. ALE offers various challenging problems and has drawn significant attention from the deep reinforcement learning (RL) community. From Deep Q-Networks (DQN) to Agent57, RL agents seem to achieve superhuman performance in ALE. However, is this the case? In this paper, to explore this problem, we first review the current evaluation metrics in the Atari benchmarks and then reveal that the current evaluation criteria of achieving superhuman performance are inappropriate, which underestimated the human performance relative to what is possible. To handle those problems and promote the development of RL research, we propose a novel Atari benchmark based on human world records (HWR), which puts forward higher requirements for RL agents on both final performance and learning efficiency. Furthermore, we summarize the state-of-the-art (SOTA) methods in Atari benchmarks and provide benchmark results over new evaluation metrics based on human world records. We concluded that at least four open challenges hinder RL agents from achieving superhuman performance from those new benchmark results. Finally, we also discuss some promising ways to handle those problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2018

Assessing Generalization in Deep Reinforcement Learning

Deep reinforcement learning (RL) has achieved breakthrough results on ma...
research
09/21/2022

Evaluation of Look-ahead Economic Dispatch Using Reinforcement Learning

Modern power systems are experiencing a variety of challenges driven by ...
research
12/06/2018

ToyBox: Better Atari Environments for Testing Reinforcement Learning Agents

It is a widely accepted principle that software without tests has bugs. ...
research
06/02/2023

Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task

This paper presents a comparison between two well-known deep Reinforceme...
research
08/30/2021

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Deep reinforcement learning (RL) algorithms are predominantly evaluated ...
research
11/09/2020

Challenges of Applying Deep Reinforcement Learning in Dynamic Dispatching

Dynamic dispatching aims to smartly allocate the right resources to the ...
research
02/27/2020

Review, Analyze, and Design a Comprehensive Deep Reinforcement Learning Framework

Reinforcement learning (RL) has emerged as a standard approach for build...

Please sign up or login with your details

Forgot password? Click here to reset