Robust Domain Randomization for Reinforcement Learning

10/23/2019
by   Reda Bahi Slaoui, et al.
21

Producing agents that can generalize to a wide range of environments is a significant challenge in reinforcement learning. One method for overcoming this issue is domain randomization, whereby at the start of each training episode some parameters of the environment are randomized so that the agent is exposed to many possible variations. However, domain randomization is highly inefficient and may lead to policies with high variance across domains. In this work, we formalize the domain randomization problem, and show that minimizing the policy's Lipschitz constant with respect to the randomization parameters leads to low variance in the learned policies. We propose a method where the agent only needs to be trained on one variation of the environment, and its learned state representations are regularized during training to minimize this constant. We conduct experiments that demonstrate that our technique leads to more efficient and robust learning than standard domain randomization, while achieving equal generalization scores.

READ FULL TEXT

page 7

page 8

page 18

research
04/09/2019

Active Domain Randomization

Domain randomization is a popular technique for improving domain transfe...
research
04/29/2021

Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

Visual domain randomization in simulated environments is a widely used m...
research
09/28/2021

Not Only Domain Randomization: Universal Policy with Embedding System Identification

Domain randomization (DR) cannot provide optimal policies for adapting t...
research
07/29/2022

Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization

Deep reinforcement learning with domain randomization learns a control p...
research
03/07/2023

Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots

Soft robots are becoming extremely popular thanks to their intrinsic saf...
research
09/15/2019

Wield: Systematic Reinforcement Learning With Progressive Randomization

Reinforcement learning frameworks have introduced abstractions to implem...
research
10/11/2019

A Simple Randomization Technique for Generalization in Deep Reinforcement Learning

Deep reinforcement learning (RL) agents often fail to generalize to unse...

Please sign up or login with your details

Forgot password? Click here to reset