Data Augmentation for Robust Character Detection in Fantasy Novels

02/09/2023
by   Arthur Amalvy, et al.
0

Named Entity Recognition (NER) is a low-level task often used as a foundation for solving higher level NLP problems. In the context of character detection in novels, NER false negatives can be an issue as they possibly imply missing certain characters or relationships completely. In this article, we demonstrate that applying a straightforward data augmentation technique allows training a model achieving higher recall, at the cost of a certain amount of precision regarding ambiguous entities. We show that this decrease in precision can be mitigated by giving the model more local context, which resolves some of the ambiguities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks

Data augmentation techniques have been used to improve the generalizatio...
research
06/01/2023

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

Complex Named Entity Recognition (NER) is the task of detecting linguist...
research
06/01/2019

Biomedical Named Entity Recognition via Reference-Set Augmented Bootstrapping

We present a weakly-supervised data augmentation approach to improve Nam...
research
01/11/2017

Generalisation in Named Entity Recognition: A Quantitative Analysis

Named Entity Recognition (NER) is a key NLP task, which is all the more ...
research
06/14/2023

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Recently, end-to-end (E2E) automatic speech recognition (ASR) models hav...
research
10/04/2020

Local Additivity Based Data Augmentation for Semi-supervised NER

Named Entity Recognition (NER) is one of the first stages in deep langua...

Please sign up or login with your details

Forgot password? Click here to reset