Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft

03/11/2019
by   Clément Romac, et al.
36

Deep Q-Learning has been successfully applied to a wide variety of tasks in the past several years. However, the architecture of the vanilla Deep Q-Network is not suited to deal with partially observable environments such as 3D video games. For this, recurrent layers had been added to the Deep Q-Network in order to allow it to handle past dependencies. We here use Minecraft for its customization advantages and design two very simple missions that can be frames as Partially Observable Markov Decision Process. We compare on these missions the Deep Q-Network and the Deep Recurrent Q-Network in order to see if the latter, which is trickier and longer to train, is always the best architecture when the agent has to deal with partial observability.

READ FULL TEXT

page 2

page 15

page 16

page 17

page 18

research
07/29/2023

Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes

Reinforcement learning has been greatly improved in recent studies and a...
research
05/17/2017

Learning to Represent Haptic Feedback for Partially-Observable Tasks

The sense of touch, being the earliest sensory system to develop in a hu...
research
10/18/2021

Lifting DecPOMDPs for Nanoscale Systems – A Work in Progress

DNA-based nanonetworks have a wide range of promising use cases, especia...
research
02/22/2022

SIPOMDPLite-Net: Lightweight, Self-Interested Learning and Planning in POSGs with Sparse Interactions

This work introduces sIPOMDPLite-net, a deep neural network (DNN) archit...
research
12/10/2021

Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

This paper proposes a new sequential model learning architecture to solv...
research
06/23/2022

Formalizing the Problem of Side Effect Regularization

AI objectives are often hard to specify properly. Some approaches tackle...
research
11/29/2022

Airfoil Shape Optimization using Deep Q-Network

The feasibility of using reinforcement learning for airfoil shape optimi...

Please sign up or login with your details

Forgot password? Click here to reset