Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages

08/09/2022
by   Manuel A. Fokam, et al.
0

African languages have recently been the subject of several studies in Natural Language Processing (NLP) and, this has caused a significant increase in their representation in the field. However, most studies tend to focus more on the models than the quality of the datasets when assessing the models' performance in tasks such as Named Entity Recognition (NER). While this works well in most cases, it does not account for the limitations of doing NLP with low-resource languages, that is, the quality and the quantity of the dataset at our disposal. This paper provides an analysis of the performance of various models based on the quality of the dataset. We evaluate different pre-trained models with respect to the entity density per sentence of some African NER datasets. We hope with this study to improve the way NLP studies are done in the context of low-resourced languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages

This paper investigates the problem of Named Entity Recognition (NER) fo...
research
05/03/2021

Switching Contexts: Transportability Measures for NLP

This paper explores the topic of transportability, as a sub-area of gene...
research
10/23/2020

A Caption Is Worth A Thousand Images: Investigating Image Captions for Multimodal Named Entity Recognition

Multimodal named entity recognition (MNER) requires to bridge the gap be...
research
06/02/2020

Exploring Cross-sentence Contexts for Named Entity Recognition with BERT

Named entity recognition (NER) is frequently addressed as a sequence cla...
research
11/03/2020

Exhaustive Entity Recognition for Coptic: Challenges and Solutions

Entity recognition provides semantic access to ancient materials in the ...
research
04/11/2023

Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages

This paper describes Adam Mickiewicz University's (AMU) solution for the...
research
07/14/2020

What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?

We evaluate named entity representations of BERT-based NLP models by inv...

Please sign up or login with your details

Forgot password? Click here to reset