AutoSimulate: (Quickly) Learning Synthetic Data Generation

08/16/2020
by   Harkirat Singh Behl, et al.
16

Simulation is increasingly being used for generating large labelled datasets in many machine learning problems. Recent methods have focused on adjusting simulator parameters with the goal of maximising accuracy on a validation task, usually relying on REINFORCE-like gradient estimators. However these approaches are very expensive as they treat the entire data generation, model training, and validation pipeline as a black-box and require multiple costly objective evaluations at each iteration. We propose an efficient alternative for optimal synthetic data generation, based on a novel differentiable approximation of the objective. This allows us to optimize the simulator, which may be non-differentiable, requiring only one objective evaluation at each iteration with a little overhead. We demonstrate on a state-of-the-art photorealistic renderer that the proposed method finds the optimal data distribution faster (up to 50×), with significantly reduced training data generation (up to 30×) and better accuracy (+8.7%) on real-world test datasets than previous methods.

READ FULL TEXT

page 11

page 12

research
10/05/2018

Learning To Simulate

Simulation is a useful tool in situations where training data for machin...
research
11/13/2021

HydraGAN A Multi-head, Multi-objective Approach to Synthetic Data Generation

Synthetic data generation overcomes limitations of real-world machine le...
research
05/03/2021

Synthetic Data for Model Selection

Recent improvements in synthetic data generation make it possible to pro...
research
06/29/2021

SinGAN-Seg: Synthetic Training Data Generation for Medical Image Segmentation

Processing medical data to find abnormalities is a time-consuming and co...
research
08/10/2023

Zero Grads Ever Given: Learning Local Surrogate Losses for Non-Differentiable Graphics

Gradient-based optimization is now ubiquitous across graphics, but unfor...
research
07/22/2022

Neural-Sim: Learning to Generate Training Data with NeRF

Training computer vision models usually requires collecting and labeling...
research
11/09/2020

Spring-Rod System Identification via Differentiable Physics Engine

We propose a novel differentiable physics engine for system identificati...

Please sign up or login with your details

Forgot password? Click here to reset