An Information-Theoretic Optimality Principle for Deep Reinforcement Learning

08/06/2017
by   Felix Leibfried, et al.
0

In this paper, we methodologically address the problem of cumulative reward overestimation in deep reinforcement learning. We generalise notions from information-theoretic bounded rationality to handle high-dimensional state spaces efficiently. The resultant algorithm encompasses a wide range of learning outcomes that can be demonstrated by tuning a Lagrange multiplier that intrinsically penalises rewards. We show that deep Q-networks arise as a special case of our proposed approach. We introduce a novel scheduling scheme for bounded-rational behaviour that ensures sample efficiency and robustness. In experiments on Atari games, we show that our algorithm outperforms other deep reinforcement learning algorithms (e.g., deep and double deep Q-networks) in terms of both game-play performance and sample complexity.

READ FULL TEXT
research
08/16/2019

Performing Deep Recurrent Double Q-Learning for Atari Games

Currently, many applications in Machine Learning are based on define new...
research
09/06/2018

Model-Based Stabilisation of Deep Reinforcement Learning

Though successful in high-dimensional domains, deep reinforcement learni...
research
11/05/2016

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening

We propose a novel training algorithm for reinforcement learning which c...
research
02/26/2019

Can Meta-Interpretive Learning outperform Deep Reinforcement Learning of Evaluable Game strategies?

World-class human players have been outperformed in a number of complex ...
research
09/03/2022

Model-Free Deep Reinforcement Learning in Software-Defined Networks

This paper compares two deep reinforcement learning approaches for cyber...
research
01/02/2023

Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback

Deep reinforcement learning has considerable potential to improve irriga...

Please sign up or login with your details

Forgot password? Click here to reset