Data Augmentation via Levy Processes

03/21/2016
by   Stefan Wager, et al.
0

If a document is about travel, we may expect that short snippets of the document should also be about travel. We introduce a general framework for incorporating these types of invariances into a discriminative classifier. The framework imagines data as being drawn from a slice of a Levy process. If we slice the Levy process at an earlier point in time, we obtain additional pseudo-examples, which can be used to train the classifier. We show that this scheme has two desirable properties: it preserves the Bayes decision boundary, and it is equivalent to fitting a generative model in the limit where we rewind time back to 0. Our construction captures popular schemes such as Gaussian feature noising and dropout training, as well as admitting new generalizations.

READ FULL TEXT
research
07/12/2022

Logistics, Graphs, and Transformers: Towards improving Travel Time Estimation

The problem of travel time estimation is widely considered as the fundam...
research
07/11/2014

Altitude Training: Strong Bounds for Single-Layer Dropout

Dropout training, originally designed for deep neural networks, has been...
research
07/16/2019

Information processing constraints in travel behaviour modelling: A generative learning approach

Travel decisions tend to exhibit sensitivity to uncertainty and informat...
research
08/30/2022

Augraphy: A Data Augmentation Library for Document Images

This paper introduces Augraphy, a Python package geared toward realistic...
research
12/15/2021

DG2: Data Augmentation Through Document Grounded Dialogue Generation

Collecting data for training dialog systems can be extremely expensive d...
research
04/30/2018

Hybrid Forests for Left Ventricle Segmentation using only the first slice label

Machine learning models produce state-of-the-art results in many MRI ima...

Please sign up or login with your details

Forgot password? Click here to reset