Investigation on Data Adaptation Techniques for Neural Named Entity Recognition

10/12/2021
by   Evgeniia Tokarchuk, et al.
0

Data processing is an important step in various natural language processing tasks. As the commonly used datasets in named entity recognition contain only a limited number of samples, it is important to obtain additional labeled data in an efficient and reliable manner. A common practice is to utilize large monolingual unlabeled corpora. Another popular technique is to create synthetic data from the original labeled data (data augmentation). In this work, we investigate the impact of these two methods on the performance of three different named entity recognition tasks.

READ FULL TEXT
research
07/02/2022

ANEC: An Amharic Named Entity Corpus and Transformer Based Recognizer

Named Entity Recognition is an information extraction task that serves a...
research
04/09/2020

Calibrating Structured Output Predictors for Natural Language Processing

We address the problem of calibrating prediction confidence for output e...
research
11/17/2018

Unnamed Entity Recognition of Sense Mentions

We consider the problem of recognizing mentions of human senses in text....
research
05/10/2023

Korean Named Entity Recognition Based on Language-Specific Features

In the paper, we propose a novel way of improving named entity recogniti...
research
08/30/2017

TANKER: Distributed Architecture for Named Entity Recognition and Disambiguation

Named Entity Recognition and Disambiguation (NERD) systems have recently...
research
10/20/2022

Unsupervised Text Deidentification

Deidentification seeks to anonymize textual data prior to distribution. ...
research
12/04/2020

Delexicalized Paraphrase Generation

We present a neural model for paraphrasing and train it to generate dele...

Please sign up or login with your details

Forgot password? Click here to reset