Single versus Multiple Annotation for Named Entity Recognition of Mutations

01/19/2021
by   David Martinez Iraola, et al.
0

The focus of this paper is to address the knowledge acquisition bottleneck for Named Entity Recognition (NER) of mutations, by analysing different approaches to build manually-annotated data. We address first the impact of using a single annotator vs two annotators, in order to measure whether multiple annotators are required. Once we evaluate the performance loss when using a single annotator, we apply different methods to sample the training data for second annotation, aiming at improving the quality of the dataset without requiring a full pass. We use held-out double-annotated data to build two scenarios with different types of rankings: similarity-based and confidence based. We evaluate both approaches on: (i) their ability to identify training instances that are erroneous (cases where single-annotator labels differ from double-annotation after discussion), and (ii) on Mutation NER performance for state-of-the-art classifiers after integrating the fixes at different thresholds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2023

Partial Annotation Learning for Biomedical Entity Recognition

Motivation: Named Entity Recognition (NER) is a key task to support biom...
research
07/08/2017

Improving Multilingual Named Entity Recognition with Wikipedia Entity Type Mapping

The state-of-the-art named entity recognition (NER) systems are statisti...
research
04/26/2022

Boundary Smoothing for Named Entity Recognition

Neural named entity recognition (NER) models may easily encounter the ov...
research
09/20/2019

Named Entity Recognition with Partially Annotated Training Data

Supervised machine learning assumes the availability of fully-labeled da...
research
10/06/2019

Named Entity Recognition – Is there a glass ceiling?

Recent developments in Named Entity Recognition (NER) have resulted in b...
research
04/19/2022

Named Entity Recognition for Partially Annotated Datasets

The most common Named Entity Recognizers are usually sequence taggers tr...
research
04/20/2021

Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

Performance of neural models for named entity recognition degrades over ...

Please sign up or login with your details

Forgot password? Click here to reset