Low-rank Subspaces for Unsupervised Entity Linking

04/18/2021
by   Akhil Arora, et al.
0

Entity linking is an important problem with many applications. Most previous solutions were designed for settings where annotated training data is available, which is, however, not the case in numerous domains. We propose a light-weight and scalable entity linking method, Eigenthemes, that relies solely on the availability of entity names and a referent knowledge base. Eigenthemes exploits the fact that the entities that are truly mentioned in a document (the "gold entities") tend to form a semantically dense subset of the set of all candidate entities in the document. Geometrically speaking, when representing entities as vectors via some given embedding, the gold entities tend to lie in a low-rank subspace of the full embedding space. Eigenthemes identifies this subspace using the singular value decomposition and scores candidate entities according to their proximity to the subspace. On the empirical front, we introduce multiple strong baselines that compare favorably to the existing state of the art. Extensive experiments on benchmark datasets from a variety of real-world domains showcase the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2018

Pair-Linking for Collective Entity Disambiguation: Two Could Be Better Than All

Collective entity disambiguation, or collective entity linking aims to j...
research
02/12/2020

Joint Embedding in Named Entity Linking on Sentence Level

Named entity linking is to map an ambiguous mention in documents to an e...
research
08/13/2019

Linking Graph Entities with Multiplicity and Provenance

Entity linking is a fundamental database problem with applicationsin dat...
research
10/05/2021

EntQA: Entity Linking as Question Answering

A conventional approach to entity linking is to first find mentions in a...
research
11/30/2017

Graph Centrality Measures for Boosting Popularity-Based Entity Linking

Many Entity Linking systems use collective graph-based methods to disamb...
research
12/15/2021

Knowledge-Rich Self-Supervised Entity Linking

Entity linking faces significant challenges, such as prolific variations...
research
09/08/2015

Probabilistic Bag-Of-Hyperlinks Model for Entity Linking

Many fundamental problems in natural language processing rely on determi...

Please sign up or login with your details

Forgot password? Click here to reset