Transformers are Sample Efficient World Models

09/01/2022
by   Vincent Micheli, et al.
19

Deep reinforcement learning agents are notoriously sample inefficient, which considerably limits their application to real-world problems. Recently, many model-based methods have been designed to address this issue, with learning in the imagination of a world model being one of the most prominent approaches. However, while virtually unlimited interaction with a simulated environment sounds appealing, the world model has to be accurate over extended periods of time. Motivated by the success of Transformers in sequence modeling tasks, we introduce IRIS, a data-efficient agent that learns in a world model composed of a discrete autoencoder and an autoregressive Transformer. With the equivalent of only two hours of gameplay in the Atari 100k benchmark, IRIS achieves a mean human normalized score of 1.046, and outperforms humans on 10 out of 26 games. Our approach sets a new state of the art for methods without lookahead search, and even surpasses MuZero. To foster future research on Transformers and world models for sample-efficient reinforcement learning, we release our codebase at https://github.com/eloialonso/iris.

READ FULL TEXT

page 3

page 4

page 5

page 8

page 18

page 19

research
06/30/2022

Deep Reinforcement Learning with Swin Transformer

Transformers are neural network models that utilize multiple layers of s...
research
04/20/2021

MBRL-Lib: A Modular Library for Model-based Reinforcement Learning

Model-based reinforcement learning is a compelling framework for data-ef...
research
04/08/2020

Adaptive Transformers in RL

Recent developments in Transformers have opened new interesting areas of...
research
06/28/2022

Masked World Models for Visual Control

Visual model-based reinforcement learning (RL) has the potential to enab...
research
03/13/2023

Transformer-based World Models Are Happy With 100k Interactions

Deep neural networks have been successful in many reinforcement learning...
research
09/06/2018

Model-Based Stabilisation of Deep Reinforcement Learning

Though successful in high-dimensional domains, deep reinforcement learni...
research
10/05/2021

An Ample Approach to Data and Modeling

In the present work, we describe a framework for modeling how models can...

Please sign up or login with your details

Forgot password? Click here to reset