Large-Scale Retrieval for Reinforcement Learning

06/10/2022
by   Peter C. Humphreys, et al.
0

Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning, the dominant paradigm is for an agent to amortise information that helps decision-making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale context-sensitive database lookups to support their parametric computations. This allows agents to directly learn in an end-to-end manner to utilise relevant information to inform their outputs. In addition, new information can be attended to by the agent, without retraining, by simply augmenting the retrieval dataset. We study this approach in Go, a challenging game for which the vast combinatorial state space privileges generalisation over direct matching to past experiences. We leverage fast, approximate nearest neighbor techniques in order to retrieve relevant data from a set of tens of millions of expert demonstration states. Attending to this information provides a significant boost to prediction accuracy and game-play performance over simply using these demonstrations as training trajectories, providing a compelling demonstration of the value of large-scale retrieval in reinforcement learning agents.

READ FULL TEXT

page 3

page 5

page 7

page 8

page 10

page 11

page 13

page 14

research
02/17/2022

Retrieval-Augmented Reinforcement Learning

Most deep reinforcement learning (RL) algorithms distill experience into...
research
04/17/2019

Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning

This paper presents Rogue-Gym, that enables agents to learn and play a s...
research
11/01/2022

Dungeons and Data: A Large-Scale NetHack Dataset

Recent breakthroughs in the development of agents to solve challenging s...
research
09/12/2021

Learning Selective Communication for Multi-Agent Path Finding

Learning communication via deep reinforcement learning (RL) or imitation...
research
06/05/2020

Balancing Reinforcement Learning Training Experiences in Interactive Information Retrieval

Interactive Information Retrieval (IIR) and Reinforcement Learning (RL) ...
research
10/01/2020

How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds

We seek to create agents that both act and communicate with other agents...
research
08/18/2021

Explainable Deep Reinforcement Learning Using Introspection in a Non-episodic Task

Explainable reinforcement learning allows artificial agents to explain t...

Please sign up or login with your details

Forgot password? Click here to reset