Model-Free Episodic Control

06/14/2016
by   Charles Blundell, et al.
0

State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance. Humans, on the other hand, can very quickly exploit highly rewarding nuances of an environment upon first discovery. In the brain, such rapid learning is thought to depend on the hippocampus and its capacity for episodic memory. Here we investigate whether a simple model of hippocampal episodic control can learn to solve difficult sequential decision-making tasks. We demonstrate that it not only attains a highly rewarding strategy significantly faster than state-of-the-art deep reinforcement learning algorithms, but also achieves a higher overall reward on some of the more challenging domains.

READ FULL TEXT
research
12/21/2020

Combining Deep Reinforcement Learning And Local Control For The Acrobot Swing-up And Balance Task

In this work we present a novel extension of soft actor critic, a state ...
research
12/29/2021

Sequential Episodic Control

State of the art deep reinforcement learning algorithms are sample ineff...
research
11/28/2022

Continuous Episodic Control

Non-parametric episodic memory can be used to quickly latch onto high-re...
research
03/08/2023

Using Memory-Based Learning to Solve Tasks with State-Action Constraints

Tasks where the set of possible actions depend discontinuously on the st...
research
02/16/2020

Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning

We explore the benefits of augmenting state-of-the-art model-free deep r...
research
01/31/2022

DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

Application of ensemble of neural networks is becoming an imminent tool ...
research
11/14/2020

Towards Human-Level Learning of Complex Physical Puzzles

Humans quickly solve tasks in novel systems with complex dynamics, witho...

Please sign up or login with your details

Forgot password? Click here to reset