Hierarchical Quality-Diversity for Online Damage Recovery

04/12/2022
by   Maxime Allard, et al.
9

Adaptation capabilities, like damage recovery, are crucial for the deployment of robots in complex environments. Several works have demonstrated that using repertoires of pre-trained skills can enable robots to adapt to unforeseen mechanical damages in a few minutes. These adaptation capabilities are directly linked to the behavioural diversity in the repertoire. The more alternatives the robot has to execute a skill, the better are the chances that it can adapt to a new situation. However, solving complex tasks, like maze navigation, usually requires multiple different skills. Finding a large behavioural diversity for these multiple skills often leads to an intractable exponential growth of the number of required solutions. In this paper, we introduce the Hierarchical Trial and Error algorithm, which uses a hierarchical behavioural repertoire to learn diverse skills and leverages them to make the robot more adaptive to different situations. We show that the hierarchical decomposition of skills enables the robot to learn more complex behaviours while keeping the learning of the repertoire tractable. The experiments with a hexapod robot show that our method solves maze navigation tasks with 20 challenging scenarios than the best baseline while having 57 failures.

READ FULL TEXT
research
10/18/2022

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

In real-world environments, robots need to be resilient to damages and r...
research
07/13/2014

Robots that can adapt like animals

As robots leave the controlled environments of factories to autonomously...
research
10/13/2016

Reset-free Trial-and-Error Learning for Robot Damage Recovery

The high probability of hardware failures prevents many advanced robots ...
research
10/05/2016

Towards semi-episodic learning for robot damage recovery

The recently introduced Intelligent Trial and Error algorithm (IT&E) ena...
research
12/10/2020

Multi-expert learning of adaptive legged locomotion

Achieving versatile robot locomotion requires motor skills which can ada...
research
04/24/2023

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Learning algorithms, like Quality-Diversity (QD), can be used to acquire...
research
02/02/2013

Fast Damage Recovery in Robotics with the T-Resilience Algorithm

Damage recovery is critical for autonomous robots that need to operate f...

Please sign up or login with your details

Forgot password? Click here to reset