An Empirical Comparison of Neural Architectures for Reinforcement Learning in Partially Observable Environments

12/17/2015
by   Denis Steckelmacher, et al.
0

This paper explores the performance of fitted neural Q iteration for reinforcement learning in several partially observable environments, using three recurrent neural network architectures: Long Short-Term Memory, Gated Recurrent Unit and MUT1, a recurrent neural architecture evolved from a pool of several thousands candidate architectures. A variant of fitted Q iteration, based on Advantage values instead of Q values, is also explored. The results show that GRU performs significantly better than LSTM and MUT1 for most of the problems considered, requiring less training episodes and less CPU time before learning a very good policy. Advantage learning also tends to produce better results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2021

A modularity comparison of Long Short-Term Memory and Morphognosis neural networks

This study compares the modularity performance of two artificial neural ...
research
05/26/2020

Comparison of Recurrent Neural Network Architectures for Wildfire Spread Modelling

Wildfire modelling is an attempt to reproduce fire behaviour. Through ac...
research
09/13/2023

Efficient quantum recurrent reinforcement learning via quantum reservoir computing

Quantum reinforcement learning (QRL) has emerged as a framework to solve...
research
09/30/2022

Efficient LSTM Training with Eligibility Traces

Training recurrent neural networks is predominantly achieved via backpro...
research
10/02/2004

Applying Policy Iteration for Training Recurrent Neural Networks

Recurrent neural networks are often used for learning time-series data. ...
research
12/20/2017

A Flexible Approach to Automated RNN Architecture Generation

The process of designing neural architectures requires expert knowledge ...
research
11/20/2019

Avoiding Jammers: A Reinforcement Learning Approach

This paper investigates the anti-jamming performance of a cognitive rada...

Please sign up or login with your details

Forgot password? Click here to reset