Efficient Augmentation via Data Subsampling

10/11/2018
by   Michael Kuchnik, et al.
0

Data augmentation is commonly used to encode invariances in learning methods. However, this process is often performed in an inefficient manner, as artificial examples are created by applying a number of transformations to all points in the training set. The resulting explosion of the dataset size can be an issue in terms of storage and training costs, as well as in selecting and tuning the optimal set of transformations to apply. In this work, we demonstrate that it is possible to significantly reduce the number of data points included in data augmentation while realizing the same accuracy and invariance benefits of augmenting the entire dataset. We propose a novel set of subsampling policies, based on model influence and loss, that can achieve a 90 reduction in augmentation set size while maintaining the accuracy gains of standard data augmentation.

READ FULL TEXT
research
06/22/2021

Data Augmentation for Opcode Sequence Based Malware Detection

Data augmentation has been successfully used in many areas of deep-learn...
research
04/07/2020

Probabilistic Spatial Transformers for Bayesian Data Augmentation

High-capacity models require vast amounts of data, and data augmentation...
research
05/29/2022

Saliency Map Based Data Augmentation

Data augmentation is a commonly applied technique with two seemingly rel...
research
12/28/2020

Data augmentation and image understanding

Interdisciplinary research is often at the core of scientific progress. ...
research
07/21/2022

Order Determination for Tensor-valued Observations Using Data Augmentation

Tensor-valued data benefits greatly from dimension reduction as the redu...
research
02/17/2017

Dataset Augmentation in Feature Space

Dataset augmentation, the practice of applying a wide array of domain-sp...
research
10/19/2020

Introducing and Applying Newtonian Blurring: An Augmented Dataset of 126,000 Human Connectomes at braingraph.org

Gaussian blurring is a well-established method for image data augmentati...

Please sign up or login with your details

Forgot password? Click here to reset