EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks

01/31/2019 ∙ by Jason W. Wei, et al. ∙ 0

We present EDA: easy data augmentation techniques for boosting performance on text classification tasks. EDA consists of four simple but powerful operations: synonym replacement, random insertion, random swap, and random deletion. On five text classification tasks, we show that EDA improves performance for both convolutional and recurrent neural networks. EDA demonstrates particularly strong results for smaller datasets; on average, across five datasets, training with EDA while using only 50 accuracy as normal training with all available data. We also performed extensive ablation studies and suggest parameters for practical use.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.