Exhaustive Entity Recognition for Coptic: Challenges and Solutions

11/03/2020
by   Amir Zeldes, et al.
0

Entity recognition provides semantic access to ancient materials in the Digital Humanities: itexposes people and places of interest in texts that cannot be read exhaustively, facilitates linkingresources and can provide a window into text contents, even for texts with no translations. Inthis paper we present entity recognition for Coptic, the language of Hellenistic era Egypt. Weevaluate NLP approaches to the task and lay out difficulties in applying them to a low-resource,morphologically complex language. We present solutions for named and non-named nested en-tity recognition and semi-automatic entity linking to Wikipedia, relying on robust dependencyparsing, feature-based CRF models, and hand-crafted knowledge base resources, enabling highaccuracy NER with orders of magnitude less data than those used for high resource languages.The results suggest avenues for research on other languages in similar settings.

READ FULL TEXT
research
05/04/2020

Soft Gazetteers for Low-Resource Named Entity Recognition

Traditional named entity recognition models use gazetteers (lists of ent...
research
08/09/2022

Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages

African languages have recently been the subject of several studies in N...
research
03/22/2021

MasakhaNER: Named Entity Recognition for African Languages

We take a step towards addressing the under-representation of the Africa...
research
04/20/2023

IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases

Named Entity Recognition (NER) is a core natural language processing tas...
research
10/24/2017

Automatic Generation of Benchmarks for Entity Recognition and Linking

The velocity dimension of Big Data plays an increasingly important role ...
research
07/01/2016

Sharing Network Parameters for Crosslingual Named Entity Recognition

Most state of the art approaches for Named Entity Recognition rely on ha...
research
07/02/2020

NLNDE: Enhancing Neural Sequence Taggers with Attention and Noisy Channel for Robust Pharmacological Entity Detection

Named entity recognition has been extensively studied on English news te...

Please sign up or login with your details

Forgot password? Click here to reset