Hypernetworks for Zero-shot Transfer in Reinforcement Learning

11/28/2022
by   Sahand Rezaei-Shoshtari, et al.
0

In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks. This work relates to meta RL, contextual RL, and transfer learning, with a particular focus on zero-shot performance at test time, enabled by knowledge of the task parameters (also known as context). Our technical approach is based upon viewing each RL algorithm as a mapping from the MDP specifics to the near-optimal value function and policy and seek to approximate it with a hypernetwork that can generate near-optimal value functions and policies, given the parameters of the MDP. We show that, under certain conditions, this mapping can be considered as a supervised learning problem. We empirically evaluate the effectiveness of our method for zero-shot transfer to new reward and transition dynamics on a series of continuous control tasks from DeepMind Control Suite. Our method demonstrates significant improvements over baselines from multitask and meta RL approaches.

READ FULL TEXT

page 5

page 6

page 11

page 12

page 14

page 15

page 16

page 17

research
10/01/2022

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Humans are capable of abstracting various tasks as different combination...
research
07/06/2020

Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer

In Hierarchical Control, compositionality, abstraction, and task-transfe...
research
07/19/2018

Multitask Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies

We introduce a new RL problem where the agent is required to execute a g...
research
10/26/2018

Transfer of Deep Reactive Policies for MDP Planning

Domain-independent probabilistic planners input an MDP description in a ...
research
09/29/2022

Does Zero-Shot Reinforcement Learning Exist?

A zero-shot RL agent is an agent that can solve any RL task in a given e...
research
02/10/2023

Robust Knowledge Transfer in Tiered Reinforcement Learning

In this paper, we study the Tiered Reinforcement Learning setting, a par...
research
03/05/2023

Bounding the Optimal Value Function in Compositional Reinforcement Learning

In the field of reinforcement learning (RL), agents are often tasked wit...

Please sign up or login with your details

Forgot password? Click here to reset