Representation Learning on Graphs: A Reinforcement Learning Application

01/16/2019
by   Sephora Madjiheurem, et al.
0

In this work, we study value function approximation in reinforcement learning (RL) problems with high dimensional state or action spaces via a generalized version of representation policy iteration (RPI). We consider the limitations of proto-value functions (PVFs) at accurately approximating the value function in low dimensions and we highlight the importance of features learning for an improved low-dimensional value function approximation. Then, we adopt different representation learning algorithm on graphs to learn the basis functions that best represent the value function. We empirically show that node2vec, an algorithm for scalable feature learning in networks, and the Variational Graph Auto-Encoder constantly outperform the commonly used smooth proto-value functions in low-dimensionl feature space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2020

Deep RBF Value Functions for Continuous Control

A core operation in reinforcement learning (RL) is finding an action tha...
research
08/28/2015

Learning Efficient Representations for Reinforcement Learning

Markov decision processes (MDPs) are a well studied framework for solvin...
research
01/31/2019

A Geometric Perspective on Optimal Representations for Reinforcement Learning

This paper proposes a new approach to representation learning based on g...
research
06/17/2021

Adapting the Function Approximation Architecture in Online Reinforcement Learning

The performance of a reinforcement learning (RL) system depends on the c...
research
01/31/2012

Feature Selection for Value Function Approximation Using Bayesian Model Selection

Feature selection in reinforcement learning (RL), i.e. choosing basis fu...
research
03/22/2019

Symbolic Regression Methods for Reinforcement Learning

Reinforcement learning algorithms can be used to optimally solve dynamic...
research
08/25/2023

Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery

We propose a nonparametric additive model for estimating interpretable v...

Please sign up or login with your details

Forgot password? Click here to reset