Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning

10/23/2020
by   Guangxiang Zhu, et al.
5

Sample efficiency has been one of the major challenges for deep reinforcement learning. Recently, model-based reinforcement learning has been proposed to address this challenge by performing planning on imaginary trajectories with a learned world model. However, world model learning may suffer from overfitting to training trajectories, and thus model-based value estimation and policy search will be pone to be sucked in an inferior local policy. In this paper, we propose a novel model-based reinforcement learning algorithm, called BrIdging Reality and Dream (BIRD). It maximizes the mutual information between imaginary and real trajectories so that the policy improvement learned from imaginary trajectories can be easily generalized to real trajectories. We demonstrate that our approach improves sample efficiency of model-based planning, and achieves state-of-the-art performance on challenging visual control benchmarks.

READ FULL TEXT

page 4

page 9

research
11/04/2020

Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning

Visual-inertial systems rely on precise calibrations of both camera intr...
research
06/19/2019

Calibrated Model-Based Deep Reinforcement Learning

Estimates of predictive uncertainty are important for accurate model-bas...
research
04/30/2023

Posterior Sampling for Deep Reinforcement Learning

Despite remarkable successes, deep reinforcement learning algorithms rem...
research
07/12/2018

The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach

Deep reinforcement learning has recently shown many impressive successes...
research
03/24/2021

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement ...
research
09/16/2022

Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning

This paper proposes a novel scoring function for the planning module of ...
research
10/12/2019

Regularizing Model-Based Planning with Energy-Based Models

Model-based reinforcement learning could enable sample-efficient learnin...

Please sign up or login with your details

Forgot password? Click here to reset