The Role of Diverse Replay for Generalisation in Reinforcement Learning

06/09/2023
by   Max Weltevrede, et al.
0

In reinforcement learning (RL), key components of many algorithms are the exploration strategy and replay buffer. These strategies regulate what environment data is collected and trained on and have been extensively studied in the RL literature. In this paper, we investigate the impact of these components in the context of generalisation in multi-task RL. We investigate the hypothesis that collecting and training on more diverse data from the training environment will improve zero-shot generalisation to new environments/tasks. We motivate mathematically and show empirically that generalisation to states that are "reachable" during training is improved by increasing the diversity of transitions in the replay buffer. Furthermore, we show empirically that this same strategy also shows improvement for generalisation to similar but "unreachable" states and could be due to improved generalisation of latent representations.

READ FULL TEXT
research
12/04/2017

A Deeper Look at Experience Replay

Experience replay plays an important role in the success of deep reinfor...
research
07/15/2023

An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Reinforcement Learning (RL) algorithms aim to learn an optimal policy by...
research
05/04/2023

Rethinking Population-assisted Off-policy Reinforcement Learning

While off-policy reinforcement learning (RL) algorithms are sample effic...
research
04/05/2023

Persuading to Prepare for Quitting Smoking with a Virtual Coach: Using States and User Characteristics to Predict Behavior

Despite their prevalence in eHealth applications for behavior change, pe...
research
10/06/2021

Replay-Guided Adversarial Environment Design

Deep reinforcement learning (RL) agents may successfully generalize to n...
research
08/23/2021

Collect Infer – a fresh look at data-efficient Reinforcement Learning

This position paper proposes a fresh look at Reinforcement Learning (RL)...
research
05/03/2021

Robotic Surgery With Lean Reinforcement Learning

As surgical robots become more common, automating away some of the burde...

Please sign up or login with your details

Forgot password? Click here to reset