Diverse Offline Imitation via Fenchel Duality

07/21/2023
by   Marin Vlastelica, et al.
0

There has been significant recent progress in the area of unsupervised skill discovery, with various works proposing mutual information based objectives, as a source of intrinsic motivation. Prior works predominantly focused on designing algorithms that require online access to the environment. In contrast, we develop an offline skill discovery algorithm. Our problem formulation considers the maximization of a mutual information objective constrained by a KL-divergence. More precisely, the constraints ensure that the state occupancy of each skill remains close to the state occupancy of an expert, within the support of an offline dataset with good state-action coverage. Our main contribution is to connect Fenchel duality, reinforcement learning and unsupervised skill discovery, and to give a simple offline algorithm for learning diverse skills that are aligned with an expert.

READ FULL TEXT

page 1

page 10

research
02/01/2022

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

We introduce Contrastive Intrinsic Control (CIC), an algorithm for unsup...
research
05/08/2023

Behavior Contrastive Learning for Unsupervised Skill Discovery

In reinforcement learning, unsupervised skill discovery aims to learn di...
research
12/14/2020

Relative Variational Intrinsic Control

In the absence of external rewards, agents can still learn useful behavi...
research
02/09/2022

Bayesian Nonparametrics for Offline Skill Discovery

Skills or low-level policies in reinforcement learning are temporally ex...
research
10/28/2021

Wasserstein Distance Maximizing Intrinsic Control

This paper deals with the problem of learning a skill-conditioned policy...
research
05/04/2023

Confidence-Based Skill Reproduction Through Perturbation Analysis

Several methods exist for teaching robots, with one of the most prominen...

Please sign up or login with your details

Forgot password? Click here to reset