Model-Based Quality-Diversity Search for Efficient Robot Learning

08/11/2020
by   Leon Keller, et al.
0

Despite recent progress in robot learning, it still remains a challenge to program a robot to deal with open-ended object manipulation tasks. One approach that was recently used to autonomously generate a repertoire of diverse skills is a novelty based Quality-Diversity (QD) algorithm. However, as most evolutionary algorithms, QD suffers from sample-inefficiency and, thus, it is challenging to apply it in real-world scenarios. This paper tackles this problem by integrating a neural network that predicts the behavior of the perturbed parameters into a novelty based QD algorithm. In the proposed Model-based Quality-Diversity search (M-QD), the network is trained concurrently to the repertoire and is used to avoid executing unpromising actions in the novelty search process. Furthermore, it is used to adapt the skills of the final repertoire in order to generalize the skills to different scenarios. Our experiments show that enhancing a QD algorithm with such a forward model improves the sample-efficiency and performance of the evolutionary process and the skill adaptation.

READ FULL TEXT

page 1

page 4

research
10/18/2022

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

In real-world environments, robots need to be resilient to damages and r...
research
01/03/2019

From exploration to control: learning object manipulation skills through novelty search and local adaptation

Programming a robot to deal with open-ended tasks remains a challenge, i...
research
06/27/2022

Emergence of Novelty in Evolutionary Algorithms

One of the main problems of evolutionary algorithms is the convergence o...
research
05/02/2023

Evolution of linkages for prototyping of linkage based robots

Prototyping robotic systems is a time consuming process. Computer aided ...
research
09/11/2022

Diversity and Novelty MasterPrints: Generating Multiple DeepMasterPrints for Increased User Coverage

This work expands on previous advancements in genetic fingerprint spoofi...
research
10/14/2022

E2R: a Hierarchical-Learning inspired Novelty-Search method to generate diverse repertoires of grasping trajectories

Robotics grasping refers to the task of making a robotic system pick an ...
research
11/04/2021

Representation Edit Distance as a Measure of Novelty

Adaptation to novelty is viewed as learning to change and augment existi...

Please sign up or login with your details

Forgot password? Click here to reset