Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

03/11/2021
by   Guillaume Bellegarda, et al.
0

Deep reinforcement learning has emerged as a popular and powerful way to develop locomotion controllers for quadruped robots. Common approaches have largely focused on learning actions directly in joint space, or learning to modify and offset foot positions produced by trajectory generators. Both approaches typically require careful reward shaping and training for millions of time steps, and with trajectory generators introduce human bias into the resulting control policies. In this paper, we instead explore learning foot positions in Cartesian space, which we track with impedance control, for a task of running as fast as possible subject to environmental disturbances. Compared with other action spaces, we observe less needed reward shaping, much improved sample efficiency, the emergence of natural gaits such as galloping and bounding, and ease of sim-to-sim transfer. Policies can be learned in only a few million time steps, even for challenging tasks of running over rough terrain with loads of over 100 in PyBullet, and we perform a sim-to-sim transfer to Gazebo, where our quadruped is able to run at over 4 m/s without a load, and 3.5 m/s with a 10 kg load, which is over 83 found at https://youtu.be/roE1vxpEWfw.

READ FULL TEXT

page 1

page 4

research
09/26/2021

Finite State Machine Policies Modulating Trajectory Generator

Deep reinforcement learning (deep RL) has emerged as an effective tool f...
research
11/09/2020

Learning Task Space Actions for Bipedal Locomotion

Recent work has demonstrated the success of reinforcement learning (RL) ...
research
10/07/2019

Policies Modulating Trajectory Generators

We propose an architecture for learning complex controllable behaviors b...
research
06/19/2023

Sim-to-real transfer of active suspension control using deep reinforcement learning

We explore sim-to-real transfer of deep reinforcement learning controlle...
research
11/01/2022

CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion

In this letter, we present a method for integrating central pattern gene...
research
05/20/2020

Learning natural locomotion behaviors for humanoid robots using human knowledge

This paper presents a new learning framework that leverages the knowledg...
research
08/29/2020

How does the structure embedded in learning policy affect learning quadruped locomotion?

Reinforcement learning (RL) is a popular data-driven method that has dem...

Please sign up or login with your details

Forgot password? Click here to reset