Data Augmentation for Deep Transfer Learning

11/28/2019
by   Cameron R. Wolfe, et al.
0

Current approaches to deep learning are beginning to rely heavily on transfer learning as an effective method for reducing overfitting, improving model performance, and quickly learning new tasks. Similarly, such pre-trained models are often used to create embedding representations for various types of data, such as text and images, which can then be fed as input into separate, downstream models. However, in cases where such transfer learning models perform poorly (i.e., for data outside of the training distribution), one must resort to fine-tuning such models, or even retraining them completely. Currently, no form of data augmentation has been proposed that can be applied directly to embedding inputs to improve downstream model performance. In this work, we introduce four new types of data augmentation that are generally applicable to embedding inputs, thus making them useful in both Natural Language Processing (NLP) and Computer Vision (CV) applications. For models trained on downstream tasks with such embedding inputs, these augmentation methods are shown to improve the AUC score of the models from a score of 0.9582 to 0.9812 and significantly increase the model's ability to identify classes of data that are not seen during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2023

Exploring Data Augmentation for Code Generation Tasks

Advances in natural language processing, such as transfer learning from ...
research
01/10/2021

Transfer Learning and Augmentation for Word Sense Disambiguation

Many downstream NLP tasks have shown significant improvement through con...
research
02/12/2020

Efficient Training of Deep Convolutional Neural Networks by Augmentation in Embedding Space

Recent advances in the field of artificial intelligence have been made p...
research
10/06/2022

Data-driven Approaches to Surrogate Machine Learning Model Development

We demonstrate the adaption of three established methods to the field of...
research
08/24/2022

Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio Text Augmentations

The absence of large labeled datasets remains a significant challenge in...
research
02/22/2021

MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture

MixUp is a computer vision data augmentation technique that uses convex ...
research
03/21/2018

Learning the Localization Function: Machine Learning Approach to Fingerprinting Localization

Considered as a data-driven approach, Fingerprinting Localization Soluti...

Please sign up or login with your details

Forgot password? Click here to reset