Graph neural induction of value iteration

09/26/2020
by   Andreea Deac, et al.
2

Many reinforcement learning tasks can benefit from explicit planning based on an internal model of the environment. Previously, such planning components have been incorporated through a neural network that partially aligns with the computational graph of value iteration. Such network have so far been focused on restrictive environments (e.g. grid-worlds), and modelled the planning procedure only indirectly. We relax these constraints, proposing a graph neural network (GNN) that executes the value iteration (VI) algorithm, across arbitrary environment models, with direct supervision on the intermediate steps of VI. The results indicate that GNNs are able to model value iteration accurately, recovering favourable metrics and policies across a variety of out-of-distribution tests. This suggests that GNN executors with strong supervision are a viable component within deep reinforcement learning systems.

READ FULL TEXT
research
04/23/2022

Graph Neural Network based Agent in Google Research Football

Deep neural networks (DNN) can approximate value functions or policies f...
research
02/09/2016

Value Iteration Networks

We introduce the value iteration network (VIN): a fully differentiable n...
research
05/12/2022

Learning Generalized Policies Without Supervision Using GNNs

We consider the problem of learning generalized policies for classical p...
research
11/29/2022

Continuous Neural Algorithmic Planners

Neural algorithmic reasoning studies the problem of learning algorithms ...
research
06/04/2020

Stochastic Graph Neural Networks

Graph neural networks (GNNs) model nonlinear representations in graph da...
research
10/11/2021

Neural Algorithmic Reasoners are Implicit Planners

Implicit planning has emerged as an elegant technique for combining lear...
research
02/11/2021

Large Scale Distributed Collaborative Unlabeled Motion Planning with Graph Policy Gradients

In this paper, we present a learning method to solve the unlabelled moti...

Please sign up or login with your details

Forgot password? Click here to reset