Learning to Walk Autonomously via Reset-Free Quality-Diversity

04/07/2022
by   Bryan Lim, et al.
0

Quality-Diversity (QD) algorithms can discover large and complex behavioural repertoires consisting of both diverse and high-performing skills. However, the generation of behavioural repertoires has mainly been limited to simulation environments instead of real-world learning. This is because existing QD algorithms need large numbers of evaluations as well as episodic resets, which require manual human supervision and interventions. This paper proposes Reset-Free Quality-Diversity optimization (RF-QD) as a step towards autonomous learning for robotics in open-ended environments. We build on Dynamics-Aware Quality-Diversity (DA-QD) and introduce a behaviour selection policy that leverages the diversity of the imagined repertoire and environmental information to intelligently select of behaviours that can act as automatic resets. We demonstrate this through a task of learning to walk within defined training zones with obstacles. Our experiments show that we can learn full repertoires of legged locomotion controllers autonomously without manual resets with high sample efficiency in spite of harsh safety constraints. Finally, using an ablation of different target objectives, we show that it is important for RF-QD to have diverse types solutions available for the behaviour selection policy over solutions optimised with a specific objective. Videos and code available at https://sites.google.com/view/rf-qd.

READ FULL TEXT
research
04/24/2023

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Learning algorithms, like Quality-Diversity (QD), can be used to acquire...
research
09/16/2021

Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires

Quality-Diversity (QD) algorithms are powerful exploration algorithms th...
research
04/14/2023

Efficient Quality-Diversity Optimization through Diverse Quality Species

A prevalent limitation of optimizing over a single objective is that it ...
research
03/19/2021

Quality Evolvability ES: Evolving Individuals With a Distribution of Well Performing and Diverse Offspring

One of the most important lessons from the success of deep learning is t...
research
09/08/2021

Quality-Diversity Meta-Evolution: customising behaviour spaces to a meta-objective

Quality-Diversity (QD) algorithms evolve behaviourally diverse and high-...
research
10/06/2022

Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

Pre-training a diverse set of robot controllers in simulation has enable...
research
07/25/2018

Prototype Discovery using Quality-Diversity

An iterative computer-aided ideation procedure is introduced, building o...

Please sign up or login with your details

Forgot password? Click here to reset