Reducing Overestimation Bias by Increasing Representation Dissimilarity in Ensemble Based Deep Q-Learning

06/24/2020
by   Hassam Ullah Sheikh, et al.
0

The first deep RL algorithm, DQN, was limited by the overestimation bias of the learned Q-function. Subsequent algorithms proposed techniques to reduce this problem, without fully eliminating it. Recently, the Maxmin and Ensemble Q-learning algorithms used the different estimates provided by ensembles of learners to reduce the bias. Unfortunately, in many scenarios the learners converge to the same point in the parametric or representation space, falling back to the classic single neural network DQN. In this paper, we describe a regularization technique to increase the dissimilarity in the representation space in these algorithms. We propose and compare five regularization functions inspired from economics theory and consensus optimization. We show that the resulting approach significantly outperforms the Maxmin and Ensemble Q-learning algorithms as well as non-ensemble baselines.

READ FULL TEXT

page 4

page 14

page 15

page 16

page 18

page 19

research
08/29/2019

Deep Neural Network Ensembles against Deception: Ensemble Diversity, Accuracy and Robustness

Ensemble learning is a methodology that integrates multiple DNN learners...
research
01/21/2021

Discussion of Ensemble Learning under the Era of Deep Learning

Due to the dominant position of deep learning (mostly deep neural networ...
research
11/05/2020

Generalized Negative Correlation Learning for Deep Ensembling

Ensemble algorithms offer state of the art performance in many machine l...
research
09/16/2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

In temporal-difference reinforcement learning algorithms, variance in va...
research
05/30/2019

Deep ensemble learning for Alzheimers disease classification

Ensemble learning use multiple algorithms to obtain better predictive pe...
research
07/10/2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search

Recently, deep learning methods have become mainstream in code search si...

Please sign up or login with your details

Forgot password? Click here to reset