Learning Humanoid Locomotion with Transformers

03/06/2023
by   Ilija Radosavovic, et al.
0

We present a sim-to-real learning-based approach for real-world humanoid locomotion. Our controller is a causal Transformer trained by autoregressive prediction of future actions from the history of observations and actions. We hypothesize that the observation-action history contains useful information about the world that a powerful Transformer model can use to adapt its behavior in-context, without updating its weights. We do not use state estimation, dynamics models, trajectory optimization, reference trajectories, or pre-computed gait libraries. Our controller is trained with large-scale model-free reinforcement learning on an ensemble of randomized environments in simulation and deployed to the real world in a zero-shot fashion. We evaluate our approach in high-fidelity simulation and successfully deploy it to the real robot as well. To the best of our knowledge, this is the first demonstration of a fully learning-based method for real-world full-sized humanoid locomotion.

READ FULL TEXT

page 1

page 6

page 7

page 8

research
07/02/2022

Learning fast and agile quadrupedal locomotion over complex terrain

In this paper, we propose a robust controller that achieves natural and ...
research
09/15/2022

Learning to Exploit Elastic Actuators for Quadruped Locomotion

Spring-based actuators in legged locomotion provide energy-efficiency an...
research
12/15/2022

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

Deep reinforcement learning has recently emerged as an appealing alterna...
research
03/04/2022

Bayesian Optimization Meets Hybrid Zero Dynamics: Safe Parameter Learning for Bipedal Locomotion Control

In this paper, we propose a multi-domain control parameter learning fram...
research
08/16/2022

A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning

Deep reinforcement learning is a promising approach to learning policies...
research
10/02/2022

Saving the Limping: Fault-tolerant Quadruped Locomotion via Reinforcement Learning

Quadruped locomotion now has acquired the skill to traverse or even spri...
research
10/20/2022

Weighted Maximum Likelihood for Controller Tuning

Recently, Model Predictive Contouring Control (MPCC) has arisen as the s...

Please sign up or login with your details

Forgot password? Click here to reset