Generalizable Episodic Memory for Deep Reinforcement Learning

03/11/2021
by   Hao Hu, et al.
0

Episodic memory-based methods can rapidly latch onto past successful strategies by a non-parametric memory and improve sample efficiency of traditional reinforcement learning. However, little effort is put into the continuous domain, where a state is never visited twice and previous episodic methods fail to efficiently aggregate experience across trajectories. To address this problem, we propose Generalizable Episodic Memory (GEM), which effectively organizes the state-action values of episodic memory in a generalizable manner and supports implicit planning on memorized trajectories. GEM utilizes a double estimator to reduce the overestimation bias induced by value propagation in the planning process. Empirical evaluation shows that our method significantly outperforms existing trajectory-based methods on various MuJoCo continuous control tasks. To further show the general applicability, we evaluate our method on Atari games with discrete action space, which also shows significant improvement over baseline algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Continuous Episodic Control

Non-parametric episodic memory can be used to quickly latch onto high-re...
research
04/20/2023

Two-Memory Reinforcement Learning

While deep reinforcement learning has shown important empirical success,...
research
06/16/2021

Solving Continuous Control with Episodic Memory

Episodic memory lets reinforcement learning algorithms remember and expl...
research
04/13/2021

Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement Learning

Reinforcement learning, which acquires a policy maximizing long-term rew...
research
02/12/2018

Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

Modern reinforcement learning algorithms reach super-human performance i...
research
08/22/2022

Efficient Planning in a Compact Latent Action Space

While planning-based sequence modelling methods have shown great potenti...

Please sign up or login with your details

Forgot password? Click here to reset