DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction

03/01/2022
by   Masashi Okada, et al.
0

The present paper proposes a novel reinforcement learning method with world models, DreamingV2, a collaborative extension of DreamerV2 and Dreaming. DreamerV2 is a cutting-edge model-based reinforcement learning from pixels that uses discrete world models to represent latent states with categorical variables. Dreaming is also a form of reinforcement learning from pixels that attempts to avoid the autoencoding process in general world model training by involving a reconstruction-free contrastive learning objective. The proposed DreamingV2 is a novel approach of adopting both the discrete representation of DreamingV2 and the reconstruction-free objective of Dreaming. Compared to DreamerV2 and other recent model-based methods without reconstruction, DreamingV2 achieves the best scores on five simulated challenging 3D robot arm tasks. We believe that DreamingV2 will be a reliable solution for robot learning since its discrete representation is suitable to describe discontinuous environments, and the reconstruction-free fashion well manages complex vision observations.

READ FULL TEXT

page 1

page 3

page 5

research
07/29/2020

Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction

In the present paper, we propose a decoder-free extension of Dreamer, a ...
research
08/06/2020

Contrastive Variational Model-Based Reinforcement Learning for Complex Observations

Deep model-based reinforcement learning (MBRL) has achieved great sample...
research
03/21/2018

Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation

Existing research studies on vision and language grounding for robot nav...
research
04/18/2020

Modeling Survival in model-based Reinforcement Learning

Although recent model-free reinforcement learning algorithms have been s...
research
04/08/2020

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

We present CURL: Contrastive Unsupervised Representations for Reinforcem...
research
08/04/2023

Learning to Shape by Grinding: Cutting-surface-aware Model-based Reinforcement Learning

Object shaping by grinding is a crucial industrial process in which a ro...

Please sign up or login with your details

Forgot password? Click here to reset