Combining model-predictive control and predictive reinforcement learning for stable quadrupedal robot locomotion

07/15/2023
by   Vyacheslav Kovalev, et al.
0

Stable gait generation is a crucial problem for legged robot locomotion as this impacts other critical performance factors such as, e.g. mobility over an uneven terrain and power consumption. Gait generation stability results from the efficient control of the interaction between the legged robot's body and the environment where it moves. Here, we study how this can be achieved by a combination of model-predictive and predictive reinforcement learning controllers. Model-predictive control (MPC) is a well-established method that does not utilize any online learning (except for some adaptive variations) as it provides a convenient interface for state constraints management. Reinforcement learning (RL), in contrast, relies on adaptation based on pure experience. In its bare-bone variants, RL is not always suitable for robots due to their high complexity and expensive simulation/experimentation. In this work, we combine both control methods to address the quadrupedal robot stable gate generation problem. The hybrid approach that we develop and apply uses a cost roll-out algorithm with a tail cost in the form of a Q-function modeled by a neural network; this allows to alleviate the computational complexity, which grows exponentially with the prediction horizon in a purely MPC approach. We demonstrate that our RL gait controller achieves stable locomotion at short horizons, where a nominal MP controller fails. Further, our controller is capable of live operation, meaning that it does not require previous training. Our results suggest that the hybridization of MPC with RL, as presented here, is beneficial to achieve a good balance between online control capabilities and computational complexity.

READ FULL TEXT

page 3

page 6

research
06/07/2021

Terrain Adaptive Gait Transitioning for a Quadruped Robot using Model Predictive Control

Legged robots can traverse challenging terrain, use perception to plan t...
research
08/29/2023

On the improvement of model-predictive controllers

This article investigates synthetic model-predictive control (MPC) probl...
research
02/22/2021

Reinforcement Learning of the Prediction Horizon in Model Predictive Control

Model predictive control (MPC) is a powerful trajectory optimization con...
research
02/08/2021

Fast Online Planning for Bipedal Locomotion via Centroidal Model Predictive Gait Synthesis

The planning of whole-body motion and step time for bipedal locomotion i...
research
11/26/2020

Optimization of the Model Predictive Control Update Interval Using Reinforcement Learning

In control applications there is often a compromise that needs to be mad...
research
03/18/2023

Hybrid Systems Neural Control with Region-of-Attraction Planner

Hybrid systems are prevalent in robotics. However, ensuring the stabilit...
research
07/27/2023

Fast Convex Visual Foothold Adaptation for Quadrupedal Locomotion

This extended abstract provides a short introduction on our recently dev...

Please sign up or login with your details

Forgot password? Click here to reset