OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

10/03/2022
by   Yuni Fuchioka, et al.
0

Reinforcement Learning (RL) has seen many recent successes for quadruped robot control. The imitation of reference motions provides a simple and powerful prior for guiding solutions towards desired solutions without the need for meticulous reward design. While much work uses motion capture data or hand-crafted trajectories as the reference motion, relatively little work has explored the use of reference motions coming from model-based trajectory optimization. In this work, we investigate several design considerations that arise with such a framework, as demonstrated through four dynamic behaviours: trot, front hop, 180 backflip, and biped stepping. These are trained in simulation and transferred to a physical Solo 8 quadruped robot without further adaptation. In particular, we explore the space of feed-forward designs afforded by the trajectory optimizer to understand its impact on RL learning efficiency and sim-to-real transfer. These findings contribute to the long standing goal of producing robot controllers that combine the interpretability and precision of model-based optimization with the robustness that model-free RL-based controllers offer.

READ FULL TEXT

page 1

page 3

research
05/18/2023

Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

We propose MIMOC: Motion Imitation from Model-Based Optimal Control. MIM...
research
09/27/2021

Model-based Motion Imitation for Agile, Diverse and Generalizable Quadupedal Locomotion

Robots operating in human environments need a variety of skills, like sl...
research
10/30/2021

Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner

Dynamic quadruped locomotion over challenging terrains with precise foot...
research
10/09/2022

Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening

Human motion synthesis is a long-standing problem with various applicati...
research
11/02/2020

Sim-to-Real Learning of All Common Bipedal Gaits via Periodic Reward Composition

We study the problem of realizing the full spectrum of bipedal locomotio...
research
07/09/2022

Optimizing Bipedal Maneuvers of Single Rigid-Body Models for Reinforcement Learning

In this work, we propose a method to generate reduced-order model refere...
research
05/15/2023

AcroMonk: A Minimalist Underactuated Brachiating Robot

Brachiation is a dynamic, coordinated swinging maneuver of body and arms...

Please sign up or login with your details

Forgot password? Click here to reset