Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss

08/16/2021
by   Thomas Effland, et al.
4

We study learning named entity recognizers in the presence of missing entity annotations. We approach this setting as tagging with latent variables and propose a novel loss, the Expected Entity Ratio, to learn models in the presence of systematically missing tags. We show that our approach is both theoretically sound and empirically useful. Experimentally, we find that it meets or exceeds performance of strong and state-of-the-art baselines across a variety of languages, annotation scenarios, and amounts of labeled data. In particular, we find that it significantly outperforms the previous state-of-the-art methods from Mayhew et al. (2019) and Li et al. (2021) by +12.7 and +2.3 F1 score in a challenging setting with only 1,000 biased annotations, averaged across 7 datasets. We also show that, when combined with our approach, a novel sparse annotation scheme outperforms exhaustive annotation for modest annotation budgets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2023

Partial Annotation Learning for Biomedical Entity Recognition

Motivation: Named Entity Recognition (NER) is a key task to support biom...
research
08/26/2019

Partially-supervised Mention Detection

Learning to detect entity mentions without using syntactic information c...
research
11/02/2022

Improving Named Entity Recognition in Telephone Conversations via Effective Active Learning with Human in the Loop

Telephone transcription data can be very noisy due to speech recognition...
research
03/09/2022

Nested Named Entity Recognition as Latent Lexicalized Constituency Parsing

Nested named entity recognition (NER) has been receiving increasing atte...
research
05/06/2023

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

Distantly-Supervised Named Entity Recognition effectively alleviates the...
research
12/30/2021

KIND: an Italian Multi-Domain Dataset for Named Entity Recognition

In this paper we present KIND, an Italian dataset for Named-Entity Recog...
research
10/03/2022

Unsilencing Colonial Archives via Automated Entity Recognition

Colonial archives are at the center of increased interest from a variety...

Please sign up or login with your details

Forgot password? Click here to reset