Text Augmentation in a Multi-Task View

01/14/2021
by   Jason Wei, et al.
0

Traditional data augmentation aims to increase the coverage of the input distribution by generating augmented examples that strongly resemble original samples in an online fashion where augmented examples dominate training. In this paper, we propose an alternative perspective – a multi-task view (MTV) of data augmentation – in which the primary task trains on original examples and the auxiliary task trains on augmented examples. In MTV data augmentation, both original and augmented samples are weighted substantively during training, relaxing the constraint that augmented examples must resemble original data and thereby allowing us to apply stronger levels of augmentation. In empirical experiments using four common data augmentation techniques on three benchmark text classification datasets, we find that the MTV leads to higher and more robust performance improvements than traditional augmentation.

READ FULL TEXT
research
03/12/2021

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Few-shot text classification is a fundamental NLP task in which a model ...
research
07/09/2020

Untapped Potential of Data Augmentation: A Domain Generalization Viewpoint

Data augmentation is a popular pre-processing trick to improve generaliz...
research
04/26/2022

Reprint: a randomized extrapolation based on principal components for data augmentation

Data scarcity and data imbalance have attracted a lot of attention in ma...
research
11/23/2020

KeepAugment: A Simple Information-Preserving Data Augmentation Approach

Data augmentation (DA) is an essential technique for training state-of-t...
research
06/02/2023

Exploring semantic information in disease: Simple Data Augmentation Techniques for Chinese Disease Normalization

The disease is a core concept in the medical field, and the task of norm...
research
02/15/2022

A Theory of PAC Learnability under Transformation Invariances

Transformation invariances are present in many real-world problems. For ...
research
09/17/2023

Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images

This research paper explores the classification of knee osteoarthritis (...

Please sign up or login with your details

Forgot password? Click here to reset