The Primacy Bias in Deep Reinforcement Learning

05/16/2022
by   Evgenii Nikishin, et al.
0

This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a tendency to rely on early interactions and ignore useful evidence encountered later. Because of training on progressively growing datasets, deep RL agents incur a risk of overfitting to earlier experiences, negatively affecting the rest of the learning process. Inspired by cognitive science, we refer to this effect as the primacy bias. Through a series of experiments, we dissect the algorithmic aspects of deep RL that exacerbate this bias. We then propose a simple yet generally-applicable mechanism that tackles the primacy bias by periodically resetting a part of the agent. We apply this mechanism to algorithms in both discrete (Atari 100k) and continuous action (DeepMind Control Suite) domains, consistently improving their performance.

READ FULL TEXT

page 14

page 16

page 17

research
04/18/2018

A Study on Overfitting in Deep Reinforcement Learning

Recent years have witnessed significant progresses in deep Reinforcement...
research
06/20/2018

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

The risks and perils of overfitting in machine learning are well known. ...
research
04/24/2020

Self-Paced Deep Reinforcement Learning

Generalization and reuse of agent behaviour across a variety of learning...
research
10/07/2021

Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model

One major barrier to applications of deep Reinforcement Learning (RL) bo...
research
04/22/2020

AutoEG: Automated Experience Grafting for Off-Policy Deep Reinforcement Learning

Deep reinforcement learning (RL) algorithms frequently require prohibiti...
research
02/26/2019

Diagnosing Bottlenecks in Deep Q-learning Algorithms

Q-learning methods represent a commonly used class of algorithms in rein...
research
09/21/2021

A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning

Abnormal states in deep reinforcement learning (RL) are states that are ...

Please sign up or login with your details

Forgot password? Click here to reset