Strong Heuristics for Named Entity Linking

07/06/2022
by   Marko Čuljak, et al.
0

Named entity linking (NEL) in news is a challenging endeavour due to the frequency of unseen and emerging entities, which necessitates the use of unsupervised or zero-shot methods. However, such methods tend to come with caveats, such as no integration of suitable knowledge bases (like Wikidata) for emerging entities, a lack of scalability, and poor interpretability. Here, we consider person disambiguation in Quotebank, a massive corpus of speaker-attributed quotations from the news, and investigate the suitability of intuitive, lightweight, and scalable heuristics for NEL in web-scale corpora. Our best performing heuristic disambiguates 94 Quotebank and the AIDA-CoNLL benchmark, respectively. Additionally, the proposed heuristics compare favourably to the state-of-the-art unsupervised and zero-shot methods, Eigenthemes and mGENRE, respectively, thereby serving as strong baselines for unsupervised and zero-shot entity linking.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2020

Clustering-based Inference for Zero-Shot Biomedical Entity Linking

Due to large number of entities in biomedical knowledge bases, only a sm...
research
07/26/2022

Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark

Modern Entity Linking (EL) systems entrench a popularity bias, yet there...
research
04/11/2022

Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0

In this work, we explore whether the recently demonstrated zero-shot abi...
research
08/07/2023

Improving Few-shot and Zero-shot Entity Linking with Coarse-to-Fine Lexicon-based Retriever

Few-shot and zero-shot entity linking focus on the tail and emerging ent...
research
09/01/2022

Find the Funding: Entity Linking with Incomplete Funding Knowledge Bases

Automatic extraction of funding information from academic articles adds ...
research
10/21/2020

Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas

In entity linking, mentions of named entities in raw text are disambigua...
research
12/15/2021

Knowledge-Rich Self-Supervised Entity Linking

Entity linking faces significant challenges, such as prolific variations...

Please sign up or login with your details

Forgot password? Click here to reset