Learning Domain Randomization Distributions for Transfer of Locomotion Policies

06/02/2019
by   Melissa Mozifian, et al.
0

Domain randomization (DR) is a successful technique for learning robust policies for robot systems, when the dynamics of the target robot system are unknown. The success of policies trained with domain randomization however, is highly dependent on the correct selection of the randomization distribution. The majority of success stories typically use real world data in order to carefully select the DR distribution, or incorporate real world trajectories to better estimate appropriate randomization distributions. In this paper, we consider the problem of finding good domain randomization parameters for simulation, without prior access to data from the target system. We explore the use of gradient-based search methods to learn a domain randomization with the following properties: 1) The trained policy should be successful in environments sampled from the domain randomization distribution 2) The domain randomization distribution should be wide enough so that the experience similar to the target robot system is observed during training, while addressing the practicality of training finite capacity models. These two properties aim to ensure the trajectories encountered in the target system are close to those observed during training, as existing methods in machine learning are better suited for interpolation than extrapolation. We show how adapting the domain randomization distribution while training context-conditioned policies results in improvements on jump-start and asymptotic performance when transferring a learned policy to the target environment.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
03/05/2020

Bayesian Domain Randomization for Sim-to-Real Transfer

When learning policies for robot control, the real-world data required i...
research
10/12/2018

Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience

We consider the problem of transferring policies to the real world by tr...
research
03/28/2019

How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Recently, reinforcement learning (RL) algorithms have demonstrated remar...
research
11/04/2020

Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion

Understanding the gap between simulation andreality is critical for rein...
research
09/28/2021

Not Only Domain Randomization: Universal Policy with Embedding System Identification

Domain randomization (DR) cannot provide optimal policies for adapting t...
research
09/18/2017

Sim-to-real Transfer of Visuo-motor Policies for Reaching in Clutter: Domain Randomization and Adaptation with Modular Networks

A modular method is proposed to learn and transfer visuo-motor policies ...
research
11/01/2021

Validate on Sim, Detect on Real – Model Selection for Domain Randomization

A practical approach to learning robot skills, often termed sim2real, is...

Please sign up or login with your details

Forgot password? Click here to reset