Understanding data augmentation for classification: when to warp?

09/28/2016
by   Sebastien C. Wong, et al.
0

In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating additional training samples are data warping, which generates additional samples through transformations applied in the data-space, and synthetic over-sampling, which creates additional samples in feature-space. We experimentally evaluate the benefits of data augmentation for a convolutional backpropagation-trained neural network, a convolutional support vector machine and a convolutional extreme learning machine classifier, using the standard MNIST handwritten digit dataset. We found that while it is possible to perform generic augmentation in feature-space, if plausible transforms for the data are known then augmentation in data-space provides a greater benefit for improving performance and reducing overfitting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2020

Image-based Automated Species Identification: Can Virtual Data Augmentation Overcome Problems of Insufficient Sampling?

Automated species identification and delimitation is challenging, partic...
research
01/12/2023

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images

Despite continued advancement in recent years, deep neural networks stil...
research
08/03/2022

A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval

Every hour, huge amounts of visual contents are posted on social media a...
research
10/29/2017

A Bayesian Data Augmentation Approach for Learning Deep Models

Data augmentation is an essential part of the training process applied t...
research
10/24/2019

Superposition as Data Augmentation using LSTM and HMM in Small Training Sets

Considering audio and image data as having quantum nature (data are repr...
research
02/17/2017

Dataset Augmentation in Feature Space

Dataset augmentation, the practice of applying a wide array of domain-sp...
research
12/18/2013

Unsupervised feature learning by augmenting single images

When deep learning is applied to visual object recognition, data augment...

Please sign up or login with your details

Forgot password? Click here to reset