Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

03/02/2017
by   Tuan Anh Le, et al.
0

We draw a formal connection between using synthetic training data to optimize neural network parameters and approximate, Bayesian, model-based reasoning. In particular, training a neural network using synthetic data can be viewed as learning a proposal distribution generator for approximate inference in the synthetic-data generative model. We demonstrate this connection in a recognition task where we develop a novel Captcha-breaking architecture and train it using synthetic data, demonstrating both state-of-the-art performance and a way of computing task-specific posterior uncertainty. Using a neural network trained this way, we also demonstrate successful breaking of real-world Captchas currently used by Facebook and Wikipedia. Reasoning from these empirical results and drawing connections with Bayesian modeling, we discuss the robustness of synthetic data results and suggest important considerations for ensuring good neural network generalization when training with synthetic data.

READ FULL TEXT
research
09/27/2018

Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects

Using synthetic data for training deep neural networks for robotic manip...
research
05/10/2021

BIM Hyperreality: Data Synthesis Using BIM and Hyperrealistic Rendering for Deep Learning

Deep learning is expected to offer new opportunities and a new paradigm ...
research
11/16/2017

An Iterative Closest Points Approach to Neural Generative Models

We present a simple way to learn a transformation that maps samples of o...
research
07/28/2022

Sequential Models in the Synthetic Data Vault

The goal of this paper is to describe a system for generating synthetic ...
research
02/25/2019

Quickly Inserting Pegs into Uncertain Holes using Multi-view Images and Deep Network Trained on Synthetic Data

This paper uses robots to assemble pegs into holes on surfaces with diff...
research
06/02/2023

Generation of Probabilistic Synthetic Data for Serious Games: A Case Study on Cyberbullying

Synthetic data generation has been a growing area of research in recent ...
research
02/28/2022

Defining a synthetic data generator for realistic electric vehicle charging sessions

Electric vehicle (EV) charging stations have become prominent in electri...

Please sign up or login with your details

Forgot password? Click here to reset