Augmenting Data with Mixup for Sentence Classification: An Empirical Study

05/22/2019
by   Hongyu Guo, et al.
0

Mixup, a recent proposed data augmentation method through linearly interpolating inputs and modeling targets of random samples, has demonstrated its capability of significantly improving the predictive accuracy of the state-of-the-art networks for image classification. However, how this technique can be applied to and what is its effectiveness on natural language processing (NLP) tasks have not been investigated. In this paper, we propose two strategies for the adaption of Mixup on sentence classification: one performs interpolation on word embeddings and another on sentence embeddings. We conduct experiments to evaluate our methods using several benchmark datasets. Our studies show that such interpolation strategies serve as an effective, domain independent data augmentation approach for sentence classification, and can result in significant accuracy improvement for both CNN and LSTM models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2021

LINDA: Unsupervised Learning to Interpolate in Natural Language Processing

Despite the success of mixup in data augmentation, its applicability to ...
research
03/15/2022

Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness

Data modification, either via additional training datasets, data augment...
research
10/05/2020

Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks

Mixup is the latest data augmentation technique that linearly interpolat...
research
06/29/2022

Teach me how to Interpolate a Myriad of Embeddings

Mixup refers to interpolation-based data augmentation, originally motiva...
research
06/15/2022

BaIT: Barometer for Information Trustworthiness

This paper presents a new approach to the FNC-1 fake news classification...
research
04/19/2023

MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning

Prompt-based learning reformulates downstream tasks as cloze problems by...
research
12/27/2022

MixupE: Understanding and Improving Mixup from Directional Derivative Perspective

Mixup is a popular data augmentation technique for training deep neural ...

Please sign up or login with your details

Forgot password? Click here to reset