TABi: Type-Aware Bi-Encoders for Open-Domain Entity Retrieval

04/18/2022
by   Megan Leszczynski, et al.
10

Entity retrieval–retrieving information about entity mentions in a query–is a key step in open-domain tasks, such as question answering or fact checking. However, state-of-the-art entity retrievers struggle to retrieve rare entities for ambiguous mentions due to biases towards popular entities. Incorporating knowledge graph types during training could help overcome popularity biases, but there are several challenges: (1) existing type-based retrieval methods require mention boundaries as input, but open-domain tasks run on unstructured text, (2) type-based methods should not compromise overall performance, and (3) type-based methods should be robust to noisy and missing types. In this work, we introduce TABi, a method to jointly train bi-encoders on knowledge graph types and unstructured text for entity retrieval for open-domain tasks. TABi leverages a type-enforced contrastive loss to encourage entities and queries of similar types to be close in the embedding space. TABi improves retrieval of rare entities on the Ambiguous Entity Retrieval (AmbER) sets, while maintaining strong overall retrieval performance on open-domain tasks in the KILT benchmark compared to state-of-the-art retrievers. TABi is also robust to incomplete type systems, improving rare entity retrieval over baselines with only 5 coverage of the training dataset. We make our code publicly available at https://github.com/HazyResearch/tabi.

READ FULL TEXT

page 2

page 5

research
06/12/2021

Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP

Retrieval is a core component for open-domain NLP tasks. In open-domain ...
research
08/28/2019

Explore Entity Embedding Effectiveness in Entity Retrieval

This paper explores entity embedding effectiveness in ad-hoc entity retr...
research
08/28/2017

On Type-Aware Entity Retrieval

Today, the practice of returning entities from a knowledge base in respo...
research
07/21/2020

Connecting Embeddings for Knowledge Graph Entity Typing

Knowledge graph (KG) entity typing aims at inferring possible missing en...
research
04/30/2021

GeoWINE: Geolocation based Wiki, Image,News and Event Retrieval

In the context of social media, geolocation inference on news or events ...
research
12/19/2021

CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding

Entity type prediction is an important problem in knowledge graph (KG) r...
research
05/19/2023

Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings

Entity linking methods based on dense retrieval are an efficient and wid...

Please sign up or login with your details

Forgot password? Click here to reset