Named Entity Recognition – Is there a glass ceiling?

10/06/2019
by   Tomasz Stanislawek, et al.
0

Recent developments in Named Entity Recognition (NER) have resulted in better and better models. However, is there a glass ceiling? Do we know which types of errors are still hard or even impossible to correct? In this paper, we present a detailed analysis of the types of errors in state-of-the-art machine learning (ML) methods. Our study reveals the weak and strong points of the Stanford, CMU, FLAIR, ELMO and BERT models, as well as their shared limitations. We also introduce new techniques for improving annotation, for training processes and for checking a model's quality and stability. Presented results are based on the CoNLL 2003 data set for the English language. A new enriched semantic annotation of errors for this data set and new diagnostic data sets are attached in the supplementary materials.

READ FULL TEXT
research
06/04/2019

NNE: A Dataset for Nested Named Entity Recognition in English Newswire

Named entity recognition (NER) is widely used in natural language proces...
research
11/02/2022

Improving Named Entity Recognition in Telephone Conversations via Effective Active Learning with Human in the Loop

Telephone transcription data can be very noisy due to speech recognition...
research
12/19/2022

E-NER – An Annotated Named Entity Recognition Corpus of Legal Text

Identifying named entities such as a person, location or organization, i...
research
05/22/2023

Partial Annotation Learning for Biomedical Entity Recognition

Motivation: Named Entity Recognition (NER) is a key task to support biom...
research
05/06/2023

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

Distantly-Supervised Named Entity Recognition effectively alleviates the...
research
06/04/2020

The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

This paper presents a new challenging information extraction task in the...
research
01/19/2021

Single versus Multiple Annotation for Named Entity Recognition of Mutations

The focus of this paper is to address the knowledge acquisition bottlene...

Please sign up or login with your details

Forgot password? Click here to reset