Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires

09/16/2021
by   Bryan Lim, et al.
0

Quality-Diversity (QD) algorithms are powerful exploration algorithms that allow robots to discover large repertoires of diverse and high-performing skills. However, QD algorithms are sample inefficient and require millions of evaluations. In this paper, we propose Dynamics-Aware Quality-Diversity (DA-QD), a framework to improve the sample efficiency of QD algorithms through the use of dynamics models. We also show how DA-QD can then be used for continual acquisition of new skill repertoires. To do so, we incrementally train a deep dynamics model from experience obtained when performing skill discovery using QD. We can then perform QD exploration in imagination with an imagined skill repertoire. We evaluate our approach on three robotic experiments. First, our experiments show DA-QD is 20 times more sample efficient than existing QD approaches for skill discovery. Second, we demonstrate learning an entirely new skill repertoire in imagination to perform zero-shot learning. Finally, we show how DA-QD is useful and effective for solving a long horizon navigation task and for damage adaptation in the real world. Videos and source code are available at: https://sites.google.com/view/da-qd.

READ FULL TEXT

page 1

page 5

research
04/07/2022

Learning to Walk Autonomously via Reset-Free Quality-Diversity

Quality-Diversity (QD) algorithms can discover large and complex behavio...
research
02/02/2022

Lipschitz-constrained Unsupervised Skill Discovery

We study the problem of unsupervised skill discovery, whose goal is to l...
research
02/10/2023

Controllability-Aware Unsupervised Skill Discovery

One of the key capabilities of intelligent agents is the ability to disc...
research
10/23/2022

BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasets

To build open-domain chatbots that are able to use diverse communicative...
research
06/08/2022

Mathematical model bridges disparate timescales of lifelong learning

Lifelong learning occurs on timescales ranging from minutes to decades. ...
research
07/03/2023

SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation

Domain adaptation (DA) has demonstrated significant promise for real-tim...
research
05/25/2023

Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning

Data augmentation (DA) is a crucial technique for enhancing the sample e...

Please sign up or login with your details

Forgot password? Click here to reset