Meta-Reinforcement Learning Using Model Parameters

10/27/2022
by   Gabriel Hartmann, et al.
0

In meta-reinforcement learning, an agent is trained in multiple different environments and attempts to learn a meta-policy that can efficiently adapt to a new environment. This paper presents RAMP, a Reinforcement learning Agent using Model Parameters that utilizes the idea that a neural network trained to predict environment dynamics encapsulates the environment information. RAMP is constructed in two phases: in the first phase, a multi-environment parameterized dynamic model is learned. In the second phase, the model parameters of the dynamic model are used as context for the multi-environment policy of the model-free reinforcement learning agent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2019

Iterative Model-Based Reinforcement Learning Using Simulations in the Differentiable Neural Computer

We propose a lifelong learning architecture, the Neural Computer Agent (...
research
05/06/2020

Safe Reinforcement Learning through Meta-learned Instincts

An important goal in reinforcement learning is to create agents that can...
research
05/11/2021

A Meta Reinforcement Learning-based Approach for Self-Adaptive System

A self-learning adaptive system (SLAS) uses machine learning to enable a...
research
09/19/2022

Meta-Reinforcement Learning for Adaptive Control of Second Order Systems

Meta-learning is a branch of machine learning which aims to synthesize d...
research
10/09/2018

Reinforcement Learning for Improving Agent Design

In many reinforcement learning tasks, the goal is to learn a policy to m...
research
09/04/2018

Recurrent World Models Facilitate Policy Evolution

A generative recurrent neural network is quickly trained in an unsupervi...
research
07/07/2023

Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning

Discovering achievements with a hierarchical structure on procedurally g...

Please sign up or login with your details

Forgot password? Click here to reset