A Convergent and Efficient Deep Q Network Algorithm

06/29/2021
by   Zhikang T. Wang, et al.
0

Despite the empirical success of the deep Q network (DQN) reinforcement learning algorithm and its variants, DQN is still not well understood and it does not guarantee convergence. In this work, we show that DQN can diverge and cease to operate in realistic settings. Although there exist gradient-based convergent methods, we show that they actually have inherent problems in learning behaviour and elucidate why they often fail in practice. To overcome these problems, we propose a convergent DQN algorithm (C-DQN) by carefully modifying DQN, and we show that the algorithm is convergent and can work with large discount factors (0.9998). It learns robustly in difficult settings and can learn several difficult games in the Atari 2600 benchmark where DQN fail, within a moderate computational budget. Our codes have been publicly released and can be used to reproduce our results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2022

Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement Learning

With the development of experimental quantum technology, quantum control...
research
07/15/2015

Massively Parallel Methods for Deep Reinforcement Learning

We present the first massively distributed architecture for deep reinfor...
research
08/05/2021

An Elementary Proof that Q-learning Converges Almost Surely

Watkins' and Dayan's Q-learning is a model-free reinforcement learning a...
research
09/09/2015

Continuous control with deep reinforcement learning

We adapt the ideas underlying the success of Deep Q-Learning to the cont...
research
08/18/2015

Distributed Deep Q-Learning

We propose a distributed deep learning model to successfully learn contr...
research
03/20/2018

Natural Gradient Deep Q-learning

This paper presents findings for training a Q-learning reinforcement lea...

Please sign up or login with your details

Forgot password? Click here to reset