A model-based approach to meta-Reinforcement Learning: Transformers and tree search

08/24/2022
by   Brieuc Pinon, et al.
0

Meta-learning is a line of research that develops the ability to leverage past experiences to efficiently solve new learning problems. Meta-Reinforcement Learning (meta-RL) methods demonstrate a capability to learn behaviors that efficiently acquire and exploit information in several meta-RL problems. In this context, the Alchemy benchmark has been proposed by Wang et al. [2021]. Alchemy features a rich structured latent space that is challenging for state-of-the-art model-free RL methods. These methods fail to learn to properly explore then exploit. We develop a model-based algorithm. We train a model whose principal block is a Transformer Encoder to fit the symbolic Alchemy environment dynamics. Then we define an online planner with the learned model using a tree search method. This algorithm significantly outperforms previously applied model-free RL methods on the symbolic Alchemy problem. Our results reveal the relevance of model-based approaches with online planning to perform exploration and exploitation successfully in meta-RL. Moreover, we show the efficiency of the Transformer architecture to learn complex dynamics that arise from latent spaces present in meta-RL problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

MELD: Meta-Reinforcement Learning from Images via Latent State Models

Meta-reinforcement learning algorithms can enable autonomous agents, suc...
research
01/11/2019

An investigation of model-free planning

The field of reinforcement learning (RL) is facing increasingly challeng...
research
01/15/2023

Neuro-symbolic Meta Reinforcement Learning for Trading

We model short-duration (e.g. day) trading in financial markets as a seq...
research
04/27/2020

Evolving Inborn Knowledge For Fast Adaptation in Dynamic POMDP Problems

Rapid online adaptation to changing tasks is an important problem in mac...
research
04/19/2020

Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination

Combining model-based and model-free learning systems has been shown to ...
research
02/07/2019

Deeper & Sparser Exploration

We address the problem of efficient exploration by proposing a new meta ...
research
12/14/2021

How to Learn and Represent Abstractions: An Investigation using Symbolic Alchemy

Alchemy is a new meta-learning environment rich enough to contain intere...

Please sign up or login with your details

Forgot password? Click here to reset