Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion

11/04/2020
by   Zhaoming Xie, et al.
0

Understanding the gap between simulation andreality is critical for reinforcement learning with legged robots,which are largely trained in simulation. However, recent workhas resulted in sometimes conflicting conclusions with regardto which factors are important for success, including therole of dynamics randomization. In this paper, we aim toprovide clarity and understanding on the role of dynamicsrandomization in learning robust locomotion policies for theLaikago quadruped robot. Surprisingly, in contrast to priorwork with the same robot model, we find that direct sim-to-real transfer is possible without dynamics randomizationor on-robot adaptation schemes. We conduct extensive abla-tion studies in a sim-to-sim setting to understand the keyissues underlying successful policy transfer, including otherdesign decisions that can impact policy robustness. We furtherground our conclusions via sim-to-real experiments with variousgaits, speeds, and stepping frequencies. Additional Details: https://www.pair.toronto.edu/understanding-dr/.

READ FULL TEXT

page 1

page 2

page 4

research
06/02/2019

Learning Domain Randomization Distributions for Transfer of Locomotion Policies

Domain randomization (DR) is a successful technique for learning robust ...
research
09/26/2022

Learning and Deploying Robust Locomotion Policies with Minimal Dynamics Randomization

Training deep reinforcement learning (DRL) locomotion policies often req...
research
02/19/2020

Sim2Real Transfer for Reinforcement Learning without Dynamics Randomization

In this work we show how to use the Operational Space Control framework ...
research
10/22/2020

Dynamics and Domain Randomized Gait Modulation with Bezier Curves for Sim-to-Real Legged Locomotion

We present a sim-to-real framework that uses dynamics and domain randomi...
research
09/29/2022

Learning Low-Frequency Motion Control for Robust and Dynamic Robot Locomotion

Robotic locomotion is often approached with the goal of maximizing robus...
research
11/19/2021

Reinforcement Learning with Adaptive Curriculum Dynamics Randomization for Fault-Tolerant Robot Control

This study is aimed at addressing the problem of fault tolerance of quad...
research
06/03/2020

Learning Memory-Based Control for Human-Scale Bipedal Locomotion

Controlling a non-statically stable biped is a difficult problem largely...

Please sign up or login with your details

Forgot password? Click here to reset