Deterministic Implementations for Reproducibility in Deep Reinforcement Learning

09/15/2018
by   prabhatn, et al.
0

While deep reinforcement learning (DRL) has led to numerous successes in recent years, reproducing these successes can be extremely challenging. One reproducibility challenge particularly relevant to DRL is nondeterminism in the training process, which can substantially affect the results. Motivated by this challenge, we study the positive impacts of deterministic implementations in eliminating nondeterminism in training. To do so, we consider the particular case of the deep Q-learning algorithm, for which we produce a deterministic implementation by identifying and controlling all sources of nondeterminism in the training process. One by one, we then allow individual sources of nondeterminism to affect our otherwise deterministic implementation, and measure the impact of each source on the variance in performance. We find that individual sources of nondeterminism can substantially impact the performance of agent, illustrating the benefits of deterministic implementations. In addition, we also discuss the important role of deterministic implementations in achieving exact replicability of results.

READ FULL TEXT

page 13

page 14

page 15

page 16

page 17

research
04/05/2018

A Human Mixed Strategy Approach to Deep Reinforcement Learning

In 2015, Google's DeepMind announced an advancement in creating an auton...
research
03/11/2019

Stroke-based Artistic Rendering Agent with Deep Reinforcement Learning

Excellent painters can use only a few strokes to create a fantastic pain...
research
09/18/2020

RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library

Recently, we have seen a rapidly growing adoption of Deep Reinforcement ...
research
04/12/2019

Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments

Reproducibility in reinforcement learning is challenging: uncontrolled s...
research
11/07/2016

Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning

Instability and variability of Deep Reinforcement Learning (DRL) algorit...
research
07/27/2023

The Effect of Third Party Implementations on Reproducibility

Reproducibility of recommender systems research has come under scrutiny ...
research
04/17/2019

On Resolving Non-determinism in Choreographies

Choreographies specify multiparty interactions via message passing. A re...

Please sign up or login with your details

Forgot password? Click here to reset