Good-Enough Example Extrapolation

09/12/2021
by   Jason Wei, et al.
0

This paper asks whether extrapolating the hidden space distribution of text examples from one class onto another is a valid inductive bias for data augmentation. To operationalize this question, I propose a simple data augmentation protocol called "good-enough example extrapolation" (GE3). GE3 is lightweight and has no hyperparameters. Applied to three text classification datasets for various data imbalance scenarios, GE3 improves performance more than upsampling and other hidden-space data augmentation methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2021

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Few-shot text classification is a fundamental NLP task in which a model ...
research
09/12/2022

DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification

This paper proposes a simple yet effective interpolation-based data augm...
research
04/21/2019

Good-Enough Compositional Data Augmentation

We propose a simple data augmentation protocol aimed at providing a comp...
research
03/31/2021

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

In this paper, we present SpecAugment++, a novel data augmentation metho...
research
04/26/2022

Reprint: a randomized extrapolation based on principal components for data augmentation

Data scarcity and data imbalance have attracted a lot of attention in ma...
research
09/20/2021

Data Augmentation Methods for Anaphoric Zero Pronouns

In pro-drop language like Arabic, Chinese, Italian, Japanese, Spanish, a...
research
04/17/2021

A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques

Recently, end-to-end mispronunciation detection and diagnosis (MD D) s...

Please sign up or login with your details

Forgot password? Click here to reset