Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark

07/26/2022
by   Zhenran Xu, et al.
0

Modern Entity Linking (EL) systems entrench a popularity bias, yet there is no dataset focusing on tail and emerging entities in languages other than English. We present Hansel, a new benchmark in Chinese that fills the vacancy of non-English few-shot and zero-shot EL challenges. The test set of Hansel is human annotated and reviewed, created with a novel method for collecting zero-shot EL datasets. It covers 10K diverse documents in news, social media posts and other web articles, with Wikidata as its target Knowledge Base. We demonstrate that the existing state-of-the-art EL system performs poorly on Hansel (R@1 of 36.6 scores a R@1 of 46.2 also show that our baseline achieves competitive results on TAC-KBP2015 Chinese Entity Linking task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2023

Improving Few-shot and Zero-shot Entity Linking with Coarse-to-Fine Lexicon-based Retriever

Few-shot and zero-shot entity linking focus on the tail and emerging ent...
research
07/06/2022

Strong Heuristics for Named Entity Linking

Named entity linking (NEL) in news is a challenging endeavour due to the...
research
12/06/2022

ZeroKBC: A Comprehensive Benchmark for Zero-Shot Knowledge Base Completion

Knowledge base completion (KBC) aims to predict the missing links in kno...
research
04/16/2021

Improving Zero-Shot Multi-Lingual Entity Linking

Entity linking – the task of identifying references in free text to rele...
research
10/21/2020

Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas

In entity linking, mentions of named entities in raw text are disambigua...
research
06/24/2020

XREF: Entity Linking for Chinese News Comments with Supplementary Article Reference

Automatic identification of mentioned entities in social media posts fac...
research
08/23/2023

Knowledge-injected Prompt Learning for Chinese Biomedical Entity Normalization

The Biomedical Entity Normalization (BEN) task aims to align raw, unstru...

Please sign up or login with your details

Forgot password? Click here to reset