Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning

07/05/2021
by   Muhammad Rizki Maulana, et al.
0

Ensemble and auxiliary tasks are both well known to improve the performance of machine learning models when data is limited. However, the interaction between these two methods is not well studied, particularly in the context of deep reinforcement learning. In this paper, we study the effects of ensemble and auxiliary tasks when combined with the deep Q-learning algorithm. We perform a case study on ATARI games under limited data constraint. Moreover, we derive a refined bias-variance-covariance decomposition to analyze the different ways of learning ensembles and using auxiliary tasks, and use the analysis to help provide some understanding of the case study. Our code is open source and available at https://github.com/NUS-LID/RENAULT.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2020

Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO

We study the roots of algorithmic progress in deep policy gradient algor...
research
02/12/2019

ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are a rema...
research
09/16/2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

In temporal-difference reinforcement learning algorithms, variance in va...
research
02/25/2021

On The Effect of Auxiliary Tasks on Representation Dynamics

While auxiliary tasks play a key role in shaping the representations lea...
research
02/15/2021

Developing parsimonious ensembles using predictor diversity within a reinforcement learning framework

Heterogeneous ensembles that can aggregate an unrestricted number and va...
research
07/04/2021

Robust Restless Bandits: Tackling Interval Uncertainty with Deep Reinforcement Learning

We introduce Robust Restless Bandits, a challenging generalization of re...
research
03/24/2020

Distributional Reinforcement Learning with Ensembles

It is well-known that ensemble methods often provide enhanced performanc...

Please sign up or login with your details

Forgot password? Click here to reset