DeepAI AI Chat
Log In Sign Up

Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning

02/22/2021
by   Lanqing Li, et al.
1

Meta-learning for offline reinforcement learning (OMRL) is an understudied problem with tremendous potential impact by enabling RL algorithms in many real-world applications. A popular solution to the problem is to infer task identity as augmented state using a context-based encoder, for which efficient learning of task representations remains an open challenge. In this work, we improve upon one of the SOTA OMRL algorithms, FOCAL, by incorporating intra-task attention mechanism and inter-task contrastive learning objectives for more effective task inference and learning of control. Theoretical analysis and experiments are presented to demonstrate the superior performance, efficiency and robustness of our end-to-end and model free method compared to prior algorithms across multiple meta-RL benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

10/02/2020

Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization

We study the offline meta-reinforcement learning (OMRL) problem, a parad...
03/03/2020

Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning

Despite recent success of deep network-based Reinforcement Learning (RL)...
04/01/2022

earning Context-aware Task Reasoning for Efficient Meta Reinforcement Learning

Despite recent success of deep network-based Reinforcement Learning (RL)...
06/21/2022

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning

We study offline meta-reinforcement learning, a practical reinforcement ...
03/17/2022

Meta Reinforcement Learning for Adaptive Control: An Offline Approach

Meta-learning is a branch of machine learning which trains neural networ...
11/15/2021

Learning Representations for Pixel-based Control: What Matters and Why?

Learning representations for pixel-based control has garnered significan...
12/24/2022

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context

One of the key challenges in deploying RL to real-world applications is ...