Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

02/24/2018
by   Patryk Chrabaszcz, et al.
0

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and MuJoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much better while on others they can get stuck in suboptimal local minima. Combining their strengths with those of traditional RL algorithms is therefore likely to lead to new advances in the state of the art.

READ FULL TEXT

page 2

page 6

research
08/16/2022

Making Reinforcement Learning Work on Swimmer

The SWIMMER environment is a standard benchmark in reinforcement learnin...
research
01/03/2023

Towards Deployable RL – What's Broken with RL Research and a Potential Fix

Reinforcement learning (RL) has demonstrated great potential, but is cur...
research
05/17/2018

Learning Time-Sensitive Strategies in Space Fortress

Although there has been remarkable progress and impressive performance o...
research
11/24/2019

ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems

Reinforcement Learning (RL) has achieved state-of-the-art results in dom...
research
12/18/2017

Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

Evolution strategies (ES) are a family of black-box optimization algorit...
research
12/29/2019

Speeding up reinforcement learning by combining attention and agency features

When playing video-games we immediately detect which entity we control a...
research
10/15/2018

Hedging Algorithms and Repeated Matrix Games

Playing repeated matrix games (RMG) while maximizing the cumulative retu...

Please sign up or login with your details

Forgot password? Click here to reset