EDIN: An End-to-end Benchmark and Pipeline for Unknown Entity Discovery and Indexing

05/25/2022
by   Nora Kassner, et al.
0

Existing work on Entity Linking mostly assumes that the reference knowledge base is complete, and therefore all mentions can be linked. In practice this is hardly ever the case, as knowledge bases are incomplete and because novel concepts arise constantly. This paper created the Unknown Entity Discovery and Indexing (EDIN) benchmark where unknown entities, that is entities without a description in the knowledge base and labeled mentions, have to be integrated into an existing entity linking system. By contrasting EDIN with zero-shot entity linking, we provide insight on the additional challenges it poses. Building on dense-retrieval based entity linking, we introduce the end-to-end EDIN pipeline that detects, clusters, and indexes mentions of unknown entities in context. Experiments show that indexing a single embedding per entity unifying the information of multiple mentions works better than indexing mentions independently.

READ FULL TEXT
research
03/08/2023

NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker

Entity Linking (EL) is the task of detecting mentions of entities in tex...
research
08/13/2019

Linking Graph Entities with Multiplicity and Provenance

Entity linking is a fundamental database problem with applicationsin dat...
research
06/21/2023

Bidirectional End-to-End Learning of Retriever-Reader Paradigm for Entity Linking

Entity Linking (EL) is a fundamental task for Information Extraction and...
research
09/02/2021

Entity Linking and Discovery via Arborescence-based Supervised Clustering

Previous work has shown promising results in performing entity linking b...
research
09/01/2022

Find the Funding: Entity Linking with Incomplete Funding Knowledge Bases

Automatic extraction of funding information from academic articles adds ...
research
10/21/2020

Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas

In entity linking, mentions of named entities in raw text are disambigua...
research
02/05/2023

TempEL: Linking Dynamically Evolving and Newly Emerging Entities

In our continuously evolving world, entities change over time and new, p...

Please sign up or login with your details

Forgot password? Click here to reset