Sample Efficient Deep Reinforcement Learning via Local Planning

01/29/2023
by   Dong Yin, et al.
0

The focus of this work is sample-efficient deep reinforcement learning (RL) with a simulator. One useful property of simulators is that it is typically easy to reset the environment to a previously observed state. We propose an algorithmic framework, named uncertainty-first local planning (UFLP), that takes advantage of this property. Concretely, in each data collection iteration, with some probability, our meta-algorithm resets the environment to an observed state which has high uncertainty, instead of sampling according to the initial-state distribution. The agent-environment interaction then proceeds as in the standard online RL setting. We demonstrate that this simple procedure can dramatically improve the sample cost of several baseline RL algorithms on difficult exploration tasks. Notably, with our framework, we can achieve super-human performance on the notoriously hard Atari game, Montezuma's Revenge, with a simple (distributional) double DQN. Our work can be seen as an efficient approximate implementation of an existing algorithm with theoretical guarantees, which offers an interpretation of the positive empirical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

Planning with Uncertainty: Deep Exploration in Model-Based Reinforcement Learning

Deep model-based Reinforcement Learning (RL) has shown super-human perfo...
research
09/03/2020

Sample-Efficient Automated Deep Reinforcement Learning

Despite significant progress in challenging problems across various doma...
research
09/13/2017

Automated Cloud Provisioning on AWS using Deep Reinforcement Learning

As the use of cloud computing continues to rise, controlling cost become...
research
10/11/2021

REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents

Deep Reinforcement Learning (Deep RL) has been in the spotlight for the ...
research
02/23/2021

MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning

In many contemporary applications such as healthcare, finance, robotics,...
research
05/13/2019

Distributional Reinforcement Learning for Efficient Exploration

In distributional reinforcement learning (RL), the estimated distributio...
research
04/30/2019

Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning

Reinforcement learning has seen great advancements in the past five year...

Please sign up or login with your details

Forgot password? Click here to reset