Hierarchical Reinforcement Learning for Quadruped Locomotion

05/22/2019
by   Deepali Jain, et al.
0

Legged locomotion is a challenging task for learning algorithms, especially when the task requires a diverse set of primitive behaviors. To solve these problems, we introduce a hierarchical framework to automatically decompose complex locomotion tasks. A high-level policy issues commands in a latent space and also selects for how long the low-level policy will execute the latent command. Concurrently, the low-level policy uses the latent command and only the robot's on-board sensors to control the robot's actuators. Our approach allows the high-level policy to run at a lower frequency than the low-level one. We test our framework on a path-following task for a dynamic quadruped robot and we show that steering behaviors automatically emerge in the latent command space as low-level skills are needed for this task. We then show efficient adaptation of the trained policy to a different task by transfer of the trained low-level policy. Finally, we validate the policies on a real quadruped robot. To the best of our knowledge, this is the first application of end-to-end hierarchical learning to a real robotic locomotion task.

READ FULL TEXT

page 1

page 4

page 5

page 6

research
11/23/2020

From Pixels to Legs: Hierarchical Learning of Quadruped Locomotion

Legged robots navigating crowded scenes and complex terrains in the real...
research
09/26/2022

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

The common approach for local navigation on challenging environments wit...
research
08/27/2020

Planning in Learned Latent Action Spaces for Generalizable Legged Locomotion

Hierarchical learning has been successful at learning generalizable loco...
research
10/17/2016

Learning and Transfer of Modulated Locomotor Controllers

We study a novel architecture and training procedure for locomotion task...
research
10/19/2012

Policy-contingent abstraction for robust robot control

This paper presents a scalable control algorithm that enables a deployed...
research
12/01/2022

Slack-based tunable damping leads to a trade-off between robustness and efficiency in legged locomotion

Animals run robustly in diverse terrain. This locomotion robustness is p...
research
04/09/2018

Latent Space Policies for Hierarchical Reinforcement Learning

We address the problem of learning hierarchical deep neural network poli...

Please sign up or login with your details

Forgot password? Click here to reset