Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning

04/01/2020
by   Marco Pleines, et al.
0

The Obstacle Tower Challenge is the task to master a procedurally generated chain of levels that subsequently get harder to complete. Whereas the top 6 performing entries of last year's competition all used human demonstrations to learn how to cope with the challenge, we present an approach that performed competitively (placed 7th) but starts completely from scratch by means of Deep Reinforcement Learning with a relatively simple feed-forward deep network structure. We especially look at the generalization performance of the taken approach concerning different seeds and various visual themes that have become available after the competition, and investigate where the agent fails and why. Note that our approach does not possess a short-term memory like employing recurrent hidden states. With this work, we hope to contribute to a better understanding of what is possible with a relatively simple, flexible solution that can be applied to learning in environments featuring complex 3D visual input where the abstract task structure itself is still fairly simple.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
03/12/2020

Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

Sample inefficiency of deep reinforcement learning methods is a major ob...
research
07/15/2019

PPO Dash: Improving Generalization in Deep Reinforcement Learning

Deep reinforcement learning is prone to overfitting, and traditional ben...
research
11/17/2021

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

The MineRL competition is designed for the development of reinforcement ...
research
08/01/2017

Natural Language Processing with Small Feed-Forward Networks

We show that small and shallow feed-forward neural networks can achieve ...
research
11/12/2018

Navigating Assistance System for Quadcopter with Deep Reinforcement Learning

In this paper, we present a deep reinforcement learning method for quadc...
research
08/05/2020

Working Memory for Online Memory Binding Tasks: A Hybrid Model

Working Memory is the brain module that holds and manipulates informatio...
research
03/31/2018

Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning

Synthesizing physiologically-accurate human movement in a variety of con...

Please sign up or login with your details

Forgot password? Click here to reset