FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning

08/13/2021
by   Jing Zhou, et al.
12

Most previous methods for text data augmentation are limited to simple tasks and weak baselines. We explore data augmentation on hard tasks (i.e., few-shot natural language understanding) and strong baselines (i.e., pretrained models with over one billion parameters). Under this setting, we reproduced a large number of previous augmentation methods and found that these methods bring marginal gains at best and sometimes degrade the performance much. To address this challenge, we propose a novel data augmentation method FlipDA that jointly uses a generative model and a classifier to generate label-flipped data. Central to the idea of FlipDA is the discovery that generating label-flipped data is more crucial to the performance than generating label-preserved data. Experiments show that FlipDA achieves a good tradeoff between effectiveness and robustness—it substantially improves many tasks while not negatively affecting the others.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2022

PromptDA: Label-guided Data Augmentation for Prompt-based Few Shot Learners

Recent advances on large pre-trained language models (PLMs) lead impress...
research
09/21/2020

SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness

Models that perform well on a training domain often fail to generalize t...
research
04/19/2023

MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning

Prompt-based learning reformulates downstream tasks as cloze problems by...
research
03/02/2023

Mixture of Soft Prompts for Controllable Data Generation

Large language models (LLMs) effectively generate fluent text when the t...
research
10/04/2022

Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

There exist features that are related to the label in the same way acros...
research
01/18/2022

Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation

Deep image matting methods have achieved increasingly better results on ...
research
03/18/2021

TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation

Automatic augmentation methods have recently become a crucial pillar for...

Please sign up or login with your details

Forgot password? Click here to reset