SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

10/17/2019
by   Jiaming Shen, et al.
0

Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous approaches either make one-time entity ranking based on distributional similarity, or resort to iterative pattern-based bootstrapping. The core challenge for these methods is how to deal with noisy context features derived from free-text corpora, which may lead to entity intrusion and semantic drifting. In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features. Experiments on three datasets show that SetExpan is robust and outperforms previous state-of-the-art methods in terms of mean average precision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2023

From Retrieval to Generation: Efficient and Effective Entity Set Expansion

Entity Set Expansion (ESE) is a critical task aiming to expand entities ...
research
01/12/2015

Tri-Subject Kinship Verification: Understanding the Core of A Family

One major challenge in computer vision is to go beyond the modeling of i...
research
12/05/2022

Entity Set Co-Expansion in StackOverflow

Given a few seed entities of a certain type (e.g., Software or Programmi...
research
12/31/2018

SynonymNet: Multi-context Bilateral Matching for Entity Synonyms

Being able to automatically discover synonymous entities from a large fr...
research
04/16/2022

Contrastive Learning with Hard Negative Entities for Entity Set Expansion

Entity Set Expansion (ESE) is a promising task which aims to expand enti...
research
07/17/2022

Automatic Context Pattern Generation for Entity Set Expansion

Entity Set Expansion (ESE) is a valuable task that aims to find entities...
research
04/04/2019

Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set Expansion

In this paper, we present a novel algorithm that combines multi-context ...

Please sign up or login with your details

Forgot password? Click here to reset