Automatic Synonym Discovery with Knowledge Bases

06/25/2017
by   Meng Qu, et al.
0

Recognizing entity synonyms from text has become a crucial task in many entity-leveraging applications. However, discovering entity synonyms from domain-specific text corpora (e.g., news articles, scientific papers) is rather challenging. Current systems take an entity name string as input to find out other names that are synonymous, ignoring the fact that often times a name string can refer to multiple entities (e.g., "apple" could refer to both Apple Inc and the fruit apple). Moreover, most existing methods require training data manually created by domain experts to construct supervised-learning systems. In this paper, we study the problem of automatic synonym discovery with knowledge bases, that is, identifying synonyms for knowledge base entities in a given domain-specific corpus. The manually-curated synonyms for each entity stored in a knowledge base not only form a set of name strings to disambiguate the meaning for each other, but also can serve as "distant" supervision to help determine important features for the task. We propose a novel framework, called DPE, to integrate two kinds of mutually-complementing signals for synonym discovery, i.e., distributional features based on corpus-level statistics and textual patterns based on local contexts. In particular, DPE jointly optimizes the two kinds of signals in conjunction with distant supervision, so that they can mutually enhance each other in the training stage. At the inference stage, both signals will be utilized to discover synonyms for the given entities. Experimental results prove the effectiveness of the proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2019

Relation Discovery with Out-of-Relation Knowledge Base as Supervision

Unsupervised relation discovery aims to discover new relations from a gi...
research
12/31/2018

SynonymNet: Multi-context Bilateral Matching for Entity Synonyms

Being able to automatically discover synonymous entities from a large fr...
research
10/27/2016

CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases

Extracting entities and relations for types of interest from text is imp...
research
12/13/2018

Same but Different: Distant Supervision for Predicting and Understanding Entity Linking Difficulty

Entity Linking (EL) is the task of automatically identifying entity ment...
research
03/13/2017

MetaPAD: Meta Pattern Discovery from Massive Text Corpora

Mining textual patterns in news, tweets, papers, and many other kinds of...
research
04/26/2018

Open Information Extraction with Global Structure Constraints

Extracting entities and their relations from text is an important task f...
research
07/22/2017

Identifying civilians killed by police with distantly supervised entity-event extraction

We propose a new, socially-impactful task for natural language processin...

Please sign up or login with your details

Forgot password? Click here to reset