Evaluating end-to-end entity linking on domain-specific knowledge bases: Learning about ancient technologies from museum collections

To study social, economic, and historical questions, researchers in the social sciences and humanities have started to use increasingly large unstructured textual datasets. While recent advances in NLP provide many tools to efficiently process such data, most existing approaches rely on generic solutions whose performance and suitability for domain-specific tasks is not well understood. This work presents an attempt to bridge this domain gap by exploring the use of modern Entity Linking approaches for the enrichment of museum collection data. We collect a dataset comprising of more than 1700 texts annotated with 7,510 mention-entity pairs, evaluate some off-the-shelf solutions in detail using this dataset and finally fine-tune a recent end-to-end EL model on this data. We show that our fine-tuned model significantly outperforms other approaches currently available in this domain and present a proof-of-concept use case of this model. We release our dataset and our best model.

READ FULL TEXT

page 7

page 11

research
06/15/2023

Multilingual End to End Entity Linking

Entity Linking is one of the most common Natural Language Processing tas...
research
09/28/2022

Cross-Domain Neural Entity Linking

Entity Linking is the task of matching a mention to an entity in a given...
research
12/20/2022

Transformers Go for the LOLs: Generating (Humourous) Titles from Scientific Abstracts End-to-End

We consider the end-to-end abstract-to-title generation problem, explori...
research
10/06/2020

Efficient One-Pass End-to-End Entity Linking for Questions

We present ELQ, a fast end-to-end entity linking model for questions, wh...
research
09/06/2015

A Hybrid Approach to Domain-Specific Entity Linking

The current state-of-the-art Entity Linking (EL) systems are geared towa...
research
10/03/2019

Extracting UMLS Concepts from Medical Text Using General and Domain-Specific Deep Learning Models

Entity recognition is a critical first step to a number of clinical NLP ...
research
10/14/2022

Robust Candidate Generation for Entity Linking on Short Social Media Texts

Entity Linking (EL) is the gateway into Knowledge Bases. Recent advances...

Please sign up or login with your details

Forgot password? Click here to reset