Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

09/14/2021
by   Haojie Shi, et al.
0

Recently reinforcement learning (RL) has emerged as a promising approach for quadrupedal locomotion, which can save the manual effort in conventional approaches such as designing skill-specific controllers. However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam. To alleviate such difficulty, we propose a novel RL-based approach that contains an evolutionary foot trajectory generator. Unlike prior methods that use a fixed trajectory generator, the generator continually optimizes the shape of the output trajectory for the given task, providing diversified motion priors to guide the policy learning. The policy is trained with reinforcement learning to output residual control signals that fit different gaits. We then optimize the trajectory generator and policy network alternatively to stabilize the training and share the exploratory data to improve sample efficiency. As a result, our approach can solve a range of challenging tasks in simulation by learning from scratch, including walking on a balance beam and crawling through the cave. To further verify the effectiveness of our approach, we deploy the controller learned in the simulation on a 12-DoF quadrupedal robot, and it can successfully traverse challenging scenarios with efficient gaits.

READ FULL TEXT

page 1

page 4

research
10/21/2020

Learning Spring Mass Locomotion: Guiding Policies with a Reduced-Order Model

In this paper, we describe an approach to achieve dynamic legged locomot...
research
07/08/2021

Adaptation of Quadruped Robot Locomotion with Meta-Learning

Animals have remarkable abilities to adapt locomotion to different terra...
research
10/07/2019

Policies Modulating Trajectory Generators

We propose an architecture for learning complex controllable behaviors b...
research
03/23/2022

Advanced Skills through Multiple Adversarial Motion Priors in Reinforcement Learning

In recent years, reinforcement learning (RL) has shown outstanding perfo...
research
10/03/2019

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

This paper presents a novel model-free reinforcement learning (RL) frame...
research
10/10/2022

Efficient Learning of Locomotion Skills through the Discovery of Diverse Environmental Trajectory Generator Priors

Data-driven learning based methods have recently been particularly succe...

Please sign up or login with your details

Forgot password? Click here to reset