Analysis of Stochastic Processes through Replay Buffers

06/26/2022
by   Shirli Di-Castro Shashua, et al.
0

Replay buffers are a key component in many reinforcement learning schemes. Yet, their theoretical properties are not fully understood. In this paper we analyze a system where a stochastic process X is pushed into a replay buffer and then randomly sampled to generate a stochastic process Y from the replay buffer. We provide an analysis of the properties of the sampled process such as stationarity, Markovity and autocorrelation in terms of the properties of the original process. Our theoretical analysis sheds light on why replay buffer may be a good de-correlator. Our analysis provides theoretical tools for proving the convergence of replay buffer based algorithms which are prevalent in reinforcement learning schemes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2023

Temporal Difference Learning with Experience Replay

Temporal-difference (TD) learning is widely regarded as one of the most ...
research
07/16/2019

Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

Deep reinforcement learning trains neural networks using experiences sam...
research
10/03/2021

Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

Reinforcement Learning (RL) has achieved significant success in applicat...
research
10/01/2021

Sim and Real: Better Together

Simulation is used extensively in autonomous systems, particularly in ro...
research
05/04/2023

Rethinking Population-assisted Off-policy Reinforcement Learning

While off-policy reinforcement learning (RL) algorithms are sample effic...
research
04/04/2018

iReplayer: In-situ and Identical Record-and-Replay for Multithreaded Applications

Reproducing executions of multithreaded programs is very challenging due...
research
07/28/2020

A Novel Token-Based Replay Technique to Speed Up Conformance Checking and Process Enhancement

Token-based replay used to be the standard way to conduct conformance ch...

Please sign up or login with your details

Forgot password? Click here to reset