Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks

11/24/2020
by   Fabio Pardo, et al.
0

Being able to reach any desired location in the environment can be a valuable asset for an agent. Learning a policy to navigate between all pairs of states individually is often not feasible. An all-goals updating algorithm uses each transition to learn Q-values towards all goals simultaneously and off-policy. However the expensive numerous updates in parallel limited the approach to small tabular cases so far. To tackle this problem we propose to use convolutional network architectures to generate Q-values and updates for a large number of goals at once. We demonstrate the accuracy and generalization qualities of the proposed method on randomly generated mazes and Sokoban puzzles. In the case of on-screen goal coordinates the resulting mapping from frames to distance-maps directly informs the agent about which places are reachable and in how many steps. As an example of application we show that replacing the random actions in ε-greedy exploration by several actions towards feasible goals generates better exploratory trajectories on Montezuma's Revenge and Super Mario All-Stars games.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

research
10/06/2018

Q-map: a Convolutional Approach for Goal-Oriented Reinforcement Learning

Goal-oriented learning has become a core concept in reinforcement learni...
research
07/05/2018

Goal-oriented Trajectories for Efficient Exploration

Exploration is a difficult challenge in reinforcement learning and even ...
research
06/22/2018

Many-Goals Reinforcement Learning

All-goals updating exploits the off-policy nature of Q-learning to updat...
research
05/20/2022

A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning

The aim of path planning is to reach the goal from starting point by sea...
research
04/04/2019

Self-Adapting Goals Allow Transfer of Predictive Models to New Tasks

A long-standing challenge in Reinforcement Learning is enabling agents t...
research
03/20/2020

Deep Sets for Generalization in RL

This paper investigates the idea of encoding object-centered representat...

Please sign up or login with your details

Forgot password? Click here to reset