DeepAI AI Chat
Log In Sign Up

Knowledge-Rich Self-Supervised Entity Linking

by   Sheng Zhang, et al.

Entity linking faces significant challenges, such as prolific variations and prevalent ambiguities, especially in high-value domains with myriad entities. Standard classification approaches suffer from the annotation bottleneck and cannot effectively handle unseen entities. Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example gold entity mentions during training and canonical descriptions for all entities, both of which are rarely available outside of Wikipedia. In this paper, we explore Knowledge-RIch Self-Supervision (KRISS) for entity linking, by leveraging readily available domain knowledge. In training, it generates self-supervised mention examples on unlabeled text using a domain ontology and trains a contextual encoder using contrastive learning. For inference, it samples self-supervised mentions as prototypes for each entity and conducts linking by mapping the test mention to the most similar prototype. Our approach subsumes zero-shot and few-shot methods, and can easily incorporate entity descriptions and gold mention labels if available. Using biomedicine as a case study, we conducted extensive experiments on seven standard datasets spanning biomedical literature and clinical notes. Without using any labeled information, our method produces KRISSBERT, a universal entity linker for four million UMLS entities, which attains new state of the art, outperforming prior self-supervised methods by as much as over 20 absolute points in accuracy.


Clustering-based Inference for Zero-Shot Biomedical Entity Linking

Due to large number of entities in biomedical knowledge bases, only a sm...

Entity Linking and Discovery via Arborescence-based Supervised Clustering

Previous work has shown promising results in performing entity linking b...

Zero-Shot Entity Linking by Reading Entity Descriptions

We present the zero-shot entity linking task, where mentions must be lin...

ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking

We introduce ReFinED, an efficient end-to-end entity linking model which...

Find the Funding: Entity Linking with Incomplete Funding Knowledge Bases

Automatic extraction of funding information from academic articles adds ...

Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

A challenge for named entity disambiguation (NED), the task of mapping t...

Knowledge Enhanced Contextual Word Representations

Contextual word representations, typically trained on unstructured, unla...