Auto-Tuned Sim-to-Real Transfer

04/15/2021
by   Yuqing Du, et al.
0

Policies trained in simulation often fail when transferred to the real world due to the `reality gap' where the simulator is unable to accurately capture the dynamics and visual properties of the real world. Current approaches to tackle this problem, such as domain randomization, require prior knowledge and engineering to determine how much to randomize system parameters in order to learn a policy that is robust to sim-to-real transfer while also not being too conservative. We propose a method for automatically tuning simulator system parameters to match the real world using only raw RGB images of the real world without the need to define rewards or estimate state. Our key insight is to reframe the auto-tuning of parameters as a search problem where we iteratively shift the simulation system parameters to approach the real-world system parameters. We propose a Search Param Model (SPM) that, given a sequence of observations and actions and a set of system parameters, predicts whether the given parameters are higher or lower than the true parameters used to generate the observations. We evaluate our method on multiple robotic control tasks in both sim-to-sim and sim-to-real transfer, demonstrating significant improvement over naive domain randomization. Project videos and code at https://yuqingd.github.io/autotuned-sim2real/

READ FULL TEXT

page 1

page 6

research
03/05/2020

Bayesian Domain Randomization for Sim-to-Real Transfer

When learning policies for robot control, the real-world data required i...
research
03/03/2020

Traversing the Reality Gap via Simulator Tuning

The large demand for simulated data has made the reality gap a problem o...
research
10/07/2021

Understanding Domain Randomization for Sim-to-real Transfer

Reinforcement learning encounters many challenges when applied directly ...
research
07/25/2019

TuneNet: One-Shot Residual Tuning for System Identification and Sim-to-Real Robot Task Transfer

As researchers teach robots to perform more and more complex tasks, the ...
research
10/20/2022

Weighted Maximum Likelihood for Controller Tuning

Recently, Model Predictive Contouring Control (MPCC) has arisen as the s...
research
09/17/2019

Learning to Manipulate Object Collections Using Grounded State Representations

We propose a method for sim-to-real robot learning which exploits simula...
research
06/02/2020

Learning Active Task-Oriented Exploration Policies for Bridging the Sim-to-Real Gap

Training robotic policies in simulation suffers from the sim-to-real gap...

Please sign up or login with your details

Forgot password? Click here to reset