Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data

Conceived in the early 1990s, Experience Replay (ER) has been shown to be a successful mechanism to allow online learning algorithms to reuse past experiences. Traditionally, ER can be applied to all machine learning paradigms (i.e., unsupervised, supervised, and reinforcement learning). Recently, ER has contributed to improving the performance of deep reinforcement learning. Yet, its application to many practical settings is still limited by the memory requirements of ER, necessary to explicitly store previous observations. To remedy this issue, we explore a novel approach, Online Contrastive Divergence with Generative Replay (OCD_GR), which uses the generative capability of Restricted Boltzmann Machines (RBMs) instead of recorded past experiences. The RBM is trained online, and does not require the system to store any of the observed data points. We compare OCD_GR to ER on 9 real-world datasets, considering a worst-case scenario (data points arriving in sorted order) as well as a more realistic one (sequential random-order data points). Our results show that in 64.28 35.72 space complexity (i.e., memory usage) at a comparable time complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2020

Double Prioritized State Recycled Experience Replay

Experience replay enables online reinforcement learning agents to store ...
research
02/04/2020

Bootstrapping a DQN Replay Memory with Synthetic Experiences

An important component of many Deep Reinforcement Learning algorithms is...
research
11/29/2021

Improving Experience Replay with Successor Representation

Prioritized experience replay is a reinforcement learning technique show...
research
10/19/2019

Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
03/12/2023

Synthetic Experience Replay

A key theme in the past decade has been that when large neural networks ...
research
10/19/2019

Towards More Sample Efficiency inReinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
12/29/2021

Sequential Episodic Control

State of the art deep reinforcement learning algorithms are sample ineff...

Please sign up or login with your details

Forgot password? Click here to reset