On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness

10/19/2022
by   Haotian Ye, et al.
0

Generalization in Reinforcement Learning (RL) aims to learn an agent during training that generalizes to the target environment. This paper studies RL generalization from a theoretical aspect: how much can we expect pre-training over training environments to be helpful? When the interaction with the target environment is not allowed, we certify that the best we can obtain is a near-optimal policy in an average sense, and we design an algorithm that achieves this goal. Furthermore, when the agent is allowed to interact with the target environment, we give a surprising result showing that asymptotically, the improvement from pre-training is at most a constant factor. On the other hand, in the non-asymptotic regime, we design an efficient algorithm and prove a distribution-based regret bound in the target environment that is independent of the state-action space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2023

Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments

One of the key challenges of Reinforcement Learning (RL) is the ability ...
research
01/04/2021

A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion Recognition

Reinforcement Learning (RL) is a semi-supervised learning paradigm which...
research
05/23/2017

Enhanced Experience Replay Generation for Efficient Reinforcement Learning

Applying deep reinforcement learning (RL) on real systems suffers from s...
research
02/09/2023

An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning

Unsupervised object-centric representation (OCR) learning has recently d...
research
06/02/2023

Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations

In real-world reinforcement learning (RL) systems, various forms of impa...
research
10/06/2022

Reinforcement Learning with Large Action Spaces for Neural Machine Translation

Applying Reinforcement learning (RL) following maximum likelihood estima...
research
03/02/2022

Evolving Curricula with Regret-Based Environment Design

It remains a significant challenge to train generally capable agents wit...

Please sign up or login with your details

Forgot password? Click here to reset