Evaluating the Performance of Reinforcement Learning Algorithms

06/30/2020
by   Scott M. Jordan, et al.
0

Performance evaluations are critical for quantifying algorithmic advances in reinforcement learning. Recent reproducibility analyses have shown that reported performance results are often inconsistent and difficult to replicate. In this work, we argue that the inconsistency of performance stems from the use of flawed evaluation metrics. Taking a step towards ensuring that reported results are consistent, we propose a new comprehensive evaluation methodology for reinforcement learning algorithms that produces reliable measurements of performance both on a single environment and when aggregated across environments. We demonstrate this method by evaluating a broad class of reinforcement learning algorithms on standard benchmark tasks.

READ FULL TEXT
research
10/31/2014

A Comparison of learning algorithms on the Arcade Learning Environment

Reinforcement learning agents have traditionally been evaluated on small...
research
12/12/2019

The PlayStation Reinforcement Learning Environment (PSXLE)

We propose a new benchmark environment for evaluating Reinforcement Lear...
research
06/12/2020

A Brief Look at Generalization in Visual Meta-Reinforcement Learning

Due to the realization that deep reinforcement learning algorithms train...
research
08/13/2019

Is Deep Reinforcement Learning Really Superhuman on Atari?

Consistent and reproducible evaluation of Deep Reinforcement Learning (D...
research
09/09/2019

A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots

As reinforcement learning (RL) achieves more success in solving complex ...
research
04/07/2018

Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis

Conventional seq2seq chatbot models only try to find the sentences with ...
research
02/27/2021

Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report

In this report, we present results reproductions for several core algori...

Please sign up or login with your details

Forgot password? Click here to reset