On Improving Deep Reinforcement Learning for POMDPs

04/17/2018
by   Pengfei Zhu, et al.
0

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e.g., computer Go. However, very little work has been done in deep RL to handle partially observable environments. We propose a new architecture called Action-specific Deep Recurrent Q-Network (ADRQN) to enhance learning performance in partially observable domains. Actions are encoded by a fully connected layer and coupled with a convolutional observation to form an action-observation pair. The time series of action-observation pairs are then integrated by an LSTM layer that learns latent states based on which a fully connected layer computes Q-values as in conventional Deep Q-Networks (DQNs). We demonstrate the effectiveness of our new architecture in several partially observable domains, including flickering Atari games.

READ FULL TEXT
research
10/15/2020

Recurrent Distributed Reinforcement Learning for Partially Observable Robotic Assembly

In this work we solve for partially observable reinforcement learning (R...
research
10/31/2017

Regret Minimization for Partially Observable Deep Reinforcement Learning

Deep reinforcement learning algorithms that estimate state and state-act...
research
05/25/2016

A PAC RL Algorithm for Episodic POMDPs

Many interesting real world domains involve reinforcement learning (RL) ...
research
01/09/2017

Reinforcement Learning via Recurrent Convolutional Neural Networks

Deep Reinforcement Learning has enabled the learning of policies for com...
research
12/10/2021

Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

This paper proposes a new sequential model learning architecture to solv...
research
06/26/2019

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Multiagent decision-making problems in partially observable environments...
research
09/21/2019

Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture

This paper introduces the modulated Hebbian plus Q network architecture ...

Please sign up or login with your details

Forgot password? Click here to reset