Learning Options via Compression

12/08/2022
by   Yiding Jiang, et al.
0

Identifying statistical regularities in solutions to some tasks in multi-task reinforcement learning can accelerate the learning of new tasks. Skill learning offers one way of identifying these regularities by decomposing pre-collected experiences into a sequence of skills. A popular approach to skill learning is maximizing the likelihood of the pre-collected experience with latent variable models, where the latent variables represent the skills. However, there are often many solutions that maximize the likelihood equally well, including degenerate solutions. To address this underspecification, we propose a new objective that combines the maximum likelihood objective with a penalty on the description length of the skills. This penalty incentivizes the skills to maximally extract common structures from the experiences. Empirically, our objective learns skills that solve downstream tasks in fewer samples compared to skills learned from only maximizing likelihood. Further, while most prior works in the offline multi-task setting focus on tasks with low-dimensional observations, our objective can scale to challenging tasks with high-dimensional image observations.

READ FULL TEXT

page 2

page 10

page 25

research
10/22/2020

Accelerating Reinforcement Learning with Learned Skill Priors

Intelligent agents rely heavily on prior experience when learning a new ...
research
09/19/2022

Latent Plans for Task-Agnostic Offline Reinforcement Learning

Everyday tasks of long-horizon and comprising a sequence of multiple imp...
research
10/06/2021

The Information Geometry of Unsupervised Reinforcement Learning

How can a reinforcement learning (RL) agent prepare to solve downstream ...
research
05/25/2018

A Scalable Approach to Multi-Context Continual Learning via Lifelong Skill Encoding

Continual or lifelong learning (CL) is one of the most challenging probl...
research
02/10/2016

Adaptive Skills, Adaptive Partitions (ASAP)

We introduce the Adaptive Skills, Adaptive Partitions (ASAP) framework t...
research
02/28/2022

Combining Modular Skills in Multitask Learning

A modular design encourages neural models to disentangle and recombine d...
research
12/09/2021

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

For robots operating in the real world, it is desirable to learn reusabl...

Please sign up or login with your details

Forgot password? Click here to reset