EgoMap: Projective mapping and structured egocentric memory for Deep RL

01/24/2020
by   Edward Beeching, et al.
0

Tasks involving localization, memorization and planning in partially observable 3D environments are an ongoing challenge in Deep Reinforcement Learning. We present EgoMap, a spatially structured neural memory architecture. EgoMap augments a deep reinforcement learning agent's performance in 3D environments on challenging tasks with multi-step objectives. The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map. The map is updated with ego-motion measurements through a differentiable affine transform. We show this architecture outperforms both standard recurrent agents and state of the art agents with structured memory. We demonstrate that incorporating these inductive biases into an agent's architecture allows for stable training with reward alone, circumventing the expense of acquiring and labelling expert trajectories. A detailed ablation study demonstrates the impact of key aspects of the architecture and through extensive qualitative analysis, we show how the agent exploits its structured internal memory to achieve higher performance.

READ FULL TEXT
research
09/18/2016

Playing FPS Games with Deep Reinforcement Learning

Advances in deep reinforcement learning have allowed autonomous agents t...
research
11/14/2018

Evolving intrinsic motivations for altruistic behavior

Multi-agent cooperation is an important feature of the natural world. Ma...
research
05/23/2022

Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization

At first sight it may seem straightforward to use recurrent layers in De...
research
06/05/2018

Relational Deep Reinforcement Learning

We introduce an approach for deep reinforcement learning (RL) that impro...
research
07/05/2019

On Inductive Biases in Deep Reinforcement Learning

Many deep reinforcement learning algorithms contain inductive biases tha...
research
02/08/2021

Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning

Although reinforcement learning has been successfully applied in many do...
research
07/12/2023

PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks

Deep reinforcement learning (RL) has shown immense potential for learnin...

Please sign up or login with your details

Forgot password? Click here to reset