Lipschitz-constrained Unsupervised Skill Discovery

02/02/2022
by   Seohong Park, et al.
6

We study the problem of unsupervised skill discovery, whose goal is to learn a set of diverse and useful skills with no external reward. There have been a number of skill discovery methods based on maximizing the mutual information (MI) between skills and states. However, we point out that their MI objectives usually prefer static skills to dynamic ones, which may hinder the application for downstream tasks. To address this issue, we propose Lipschitz-constrained Skill Discovery (LSD), which encourages the agent to discover more diverse, dynamic, and far-reaching skills. Another benefit of LSD is that its learned representation function can be utilized for solving goal-following downstream tasks even in a zero-shot manner - i.e., without further training or complex planning. Through experiments on various MuJoCo robotic locomotion and manipulation environments, we demonstrate that LSD outperforms previous approaches in terms of skill diversity, state space coverage, and performance on seven downstream tasks including the challenging task of following multiple goals on Humanoid. Our code and videos are available at https://shpark.me/projects/lsd/.

READ FULL TEXT

page 22

page 23

page 24

research
05/08/2023

Behavior Contrastive Learning for Unsupervised Skill Discovery

In reinforcement learning, unsupervised skill discovery aims to learn di...
research
02/10/2023

Controllability-Aware Unsupervised Skill Discovery

One of the key capabilities of intelligent agents is the ability to disc...
research
04/29/2022

Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery

Current reinforcement learning (RL) in robotics often experiences diffic...
research
10/18/2021

Discovering and Achieving Goals via World Models

How can artificial agents learn to solve many diverse tasks in complex v...
research
09/16/2021

Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires

Quality-Diversity (QD) algorithms are powerful exploration algorithms th...
research
11/23/2022

Choreographer: Learning and Adapting Skills in Imagination

Unsupervised skill learning aims to learn a rich repertoire of behaviors...
research
07/28/2022

Learning Dynamic Manipulation Skills from Haptic-Play

In this paper, we propose a data-driven skill learning approach to solve...

Please sign up or login with your details

Forgot password? Click here to reset