Multi-expert learning of adaptive legged locomotion

12/10/2020
by   Chuanyu Yang, et al.
1

Achieving versatile robot locomotion requires motor skills which can adapt to previously unseen situations. We propose a Multi-Expert Learning Architecture (MELA) that learns to generate adaptive skills from a group of representative expert skills. During training, MELA is first initialised by a distinct set of pre-trained experts, each in a separate deep neural network (DNN). Then by learning the combination of these DNNs using a Gating Neural Network (GNN), MELA can acquire more specialised experts and transitional skills across various locomotion modes. During runtime, MELA constantly blends multiple DNNs and dynamically synthesises a new DNN to produce adaptive behaviours in response to changing situations. This approach leverages the advantages of trained expert skills and the fast online synthesis of adaptive policies to generate responsive motor skills during the changing tasks. Using a unified MELA framework, we demonstrated successful multi-skill locomotion on a real quadruped robot that performed coherent trotting, steering, and fall recovery autonomously, and showed the merit of multi-expert learning generating behaviours which can adapt to unseen scenarios.

READ FULL TEXT

page 2

page 6

page 9

page 14

page 20

page 36

page 38

page 42

research
02/14/2023

Agile and Versatile Robot Locomotion via Kernel-based Residual Learning

This work developed a kernel-based residual learning framework for quadr...
research
06/29/2023

Identifying Important Sensory Feedback for Learning Locomotion Skills

Robot motor skills can be learned through deep reinforcement learning (D...
research
09/23/2017

Multi-task Learning with Gradient Guided Policy Specialization

We present a method for efficient learning of control policies for multi...
research
04/12/2022

Hierarchical Quality-Diversity for Online Damage Recovery

Adaptation capabilities, like damage recovery, are crucial for the deplo...
research
06/14/2023

Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation

This paper proposes the transition-net, a robust transition strategy tha...
research
10/18/2022

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

In real-world environments, robots need to be resilient to damages and r...
research
10/22/2019

Learning Humanoid Robot Running Skills through Proximal Policy Optimization

In the current level of evolution of Soccer 3D, motion control is a key ...

Please sign up or login with your details

Forgot password? Click here to reset