An Analysis of Simple Data Augmentation for Named Entity Recognition

10/22/2020
by   Xiang Dai, et al.
0

Simple yet effective data augmentation techniques have been proposed for sentence-level and sentence-pair natural language processing tasks. Inspired by these efforts, we design and compare data augmentation for named entity recognition, which is usually modeled as a token-level sequence labeling problem. Through experiments on two data sets from the biomedical and materials science domains (i2b2-2010 and MaSciP), we show that simple augmentation can boost performance for both recurrent and transformer-based models, especially for small training sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2022

Syntax-driven Data Augmentation for Named Entity Recognition

In low resource settings, data augmentation strategies are commonly leve...
research
03/28/2022

Hierarchical Transformer Model for Scientific Named Entity Recognition

The task of Named Entity Recognition (NER) is an important component of ...
research
10/05/2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Active learning is an important technique for low-resource sequence labe...
research
10/09/2020

iobes: A Library for Span-Level Processing

Many tasks in natural language processing, such as named entity recognit...
research
06/25/2022

ConcreteGraph: A Data Augmentation Method Leveraging the Properties of Concept Relatedness Estimation

The concept relatedness estimation (CRE) task is to determine whether tw...
research
12/04/2020

Delexicalized Paraphrase Generation

We present a neural model for paraphrasing and train it to generate dele...
research
10/20/2019

A Semi-Automated Approach for Information Extraction, Classification and Analysis of Unstructured Data

In this paper, we show how Quantitative Narrative Analysis and simple Na...

Please sign up or login with your details

Forgot password? Click here to reset