Towards Sample Efficient Agents through Algorithmic Alignment

08/07/2020
by   Mingxuan Li, et al.
0

Deep reinforcement-learning agents have demonstrated great success on various tasks. However, current methods typically suffer from sample complexity problems when learning in high dimensional observation spaces, which limits the application of deep reinforcement-learning agents to complex, uncertain real-world tasks. In this work, we propose and explore Deep Graph Value Network as a promising method to work around this drawback using a message-passing mechanism. The main idea is that the RL agent should be guided by structured non-neural-network algorithms like dynamic programming. According to recent advances in algorithmic alignment, neural networks with structured computation procedures can be trained efficiently. We demonstrate the potential of graph neural network in supporting sample efficient learning by showing that Deep Graph Value Network can outperform unstructured baselines by a large margin with low sample complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2021

Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning

In real-world tasks, reinforcement learning (RL) agents frequently encou...
research
11/29/2022

Continuous Neural Algorithmic Planners

Neural algorithmic reasoning studies the problem of learning algorithms ...
research
03/02/2020

Scaling Up Multiagent Reinforcement Learning for Robotic Systems: Learn an Adaptive Sparse Communication Graph

The complexity of multiagent reinforcement learning (MARL) in multiagent...
research
09/21/2021

Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning

We introduce a novel method to teach a robotic agent to interactively ex...
research
12/07/2015

How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies

Using deep neural nets as function approximator for reinforcement learni...
research
03/29/2022

Graph Neural Networks are Dynamic Programmers

Recent advances in neural algorithmic reasoning with graph neural networ...
research
09/06/2018

Model-Based Stabilisation of Deep Reinforcement Learning

Though successful in high-dimensional domains, deep reinforcement learni...

Please sign up or login with your details

Forgot password? Click here to reset