Namesakes: Ambiguously Named Entities from Wikipedia and News

11/22/2021
by   Oleg Vasilyev, et al.
0

We present Namesakes, a dataset of ambiguously named entities obtained from English-language Wikipedia and news articles. It consists of 58862 mentions of 4148 unique entities and their namesakes: 1000 mentions from news, 28843 from Wikipedia articles about the entity, and 29019 Wikipedia backlink mentions. Namesakes should be helpful in establishing challenging benchmarks for the task of named entity linking (NEL).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2018

pioNER: Datasets and Baselines for Armenian Named Entity Recognition

In this work, we tackle the problem of Armenian named entity recognition...
research
09/15/2021

WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres

Previous work on Entity Linking has focused on resources targeting non-n...
research
12/14/2022

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia

With the ever-growing popularity of the field of NLP, the demand for dat...
research
04/25/2023

Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining

Named entities are ubiquitous in text that naturally accompanies images,...
research
09/14/2019

Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set

Wikipedia is a great source of general world knowledge which can guide N...
research
02/26/2020

Detecting Potential Topics In News Using BERT, CRF and Wikipedia

For a news content distribution platform like Dailyhunt, Named Entity Re...
research
04/21/2021

Text Summarization of Czech News Articles Using Named Entities

The foundation for the research of summarization in the Czech language w...

Please sign up or login with your details

Forgot password? Click here to reset