Active Domain Randomization

04/09/2019
by   Bhairav Mehta, et al.
0

Domain randomization is a popular technique for improving domain transfer, often used in a zero-shot setting when the target domain is unknown or cannot easily be used for training. In this work, we empirically examine the effects of domain randomization on agent generalization. Our experiments show that domain randomization may lead to suboptimal, high-variance policies, which we attribute to the uniform sampling of environment parameters. We propose Active Domain Randomization, a novel algorithm that learns a parameter sampling strategy. Our method looks for the most informative environment variations within the given randomization ranges by leveraging the discrepancies of policy rollouts in randomized and reference environment instances. We find that training more frequently on these instances leads to better overall agent generalization. In addition, when domain randomization and policy transfer fail, Active Domain Randomization offers more insight into the deficiencies of both the chosen parameter ranges and the learned policy, allowing for more focused debugging. Our experiments across various physics-based simulated and a real-robot task show that this enhancement leads to more robust, consistent policies.

READ FULL TEXT

page 2

page 6

research
10/23/2019

Robust Domain Randomization for Reinforcement Learning

Producing agents that can generalize to a wide range of environments is ...
research
04/29/2021

Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

Visual domain randomization in simulated environments is a widely used m...
research
12/03/2020

Intervention Design for Effective Sim2Real Transfer

The goal of this work is to address the recent success of domain randomi...
research
11/03/2020

Policy Transfer via Kinematic Domain Randomization and Adaptation

Transferring reinforcement learning policies trained in physics simulati...
research
12/03/2018

VADRA: Visual Adversarial Domain Randomization and Augmentation

We address the issue of learning from synthetic domain randomized data e...
research
05/19/2022

Concurrent Policy Blending and System Identification for Generalized Assistive Control

In this work, we address the problem of solving complex collaborative ro...
research
07/29/2022

Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization

Deep reinforcement learning with domain randomization learns a control p...

Please sign up or login with your details

Forgot password? Click here to reset