Deep Reinforcement Learning with Graph-based State Representations

04/29/2020
by   Vikram Waradpande, et al.
66

Deep RL approaches build much of their success on the ability of the deep neural network to generate useful internal representations. Nevertheless, they suffer from a high sample-complexity and starting with a good input representation can have a significant impact on the performance. In this paper, we exploit the fact that the underlying Markov decision process (MDP) represents a graph, which enables us to incorporate the topological information for effective state representation learning. Motivated by the recent success of node representations for several graph analytical tasks we specifically investigate the capability of node representation learning methods to effectively encode the topology of the underlying MDP in Deep RL. To this end we perform a comparative analysis of several models chosen from 4 different classes of representation learning algorithms for policy learning in grid-world navigation tasks, which are representative of a large class of RL problems. We find that all embedding methods outperform the commonly used matrix representation of grid-world environments in all of the studied cases. Moreoever, graph convolution based methods are outperformed by simpler random walk based methods and graph linear autoencoders.

READ FULL TEXT

page 10

page 11

page 12

research
06/15/2021

On the Power of Multitask Representation Learning in Linear MDP

While multitask representation learning has become a popular approach in...
research
05/11/2020

TOMA: Topological Map Abstraction for Reinforcement Learning

Animals are able to discover the topological map (graph) of surrounding ...
research
10/26/2020

Reinforcement Learning Enhanced Heterogeneous Graph Neural Network

Heterogeneous Information Networks (HINs), involving a diversity of node...
research
11/19/2020

GL-Coarsener: A Graph representation learning framework to construct coarse grid hierarchy for AMG solvers

In many numerical schemes, the computational complexity scales non-linea...
research
10/04/2019

Learning Robust Representations with Graph Denoising Policy Network

Graph representation learning, aiming to learn low-dimensional represent...
research
10/25/2020

XLVIN: eXecuted Latent Value Iteration Nets

Value Iteration Networks (VINs) have emerged as a popular method to inco...

Please sign up or login with your details

Forgot password? Click here to reset