Retrieving Multi-Entity Associations: An Evaluation of Combination Modes for Word Embeddings

05/22/2019
by   Gloria Feher, et al.
0

Word embeddings have gained significant attention as learnable representations of semantic relations between words, and have been shown to improve upon the results of traditional word representations. However, little effort has been devoted to using embeddings for the retrieval of entity associations beyond pairwise relations. In this paper, we use popular embedding methods to train vector representations of an entity-annotated news corpus, and evaluate their performance for the task of predicting entity participation in news events versus a traditional word cooccurrence network as a baseline. To support queries for events with multiple participating entities, we test a number of combination modes for the embedding vectors. While we find that even the best combination modes for word embeddings do not quite reach the performance of the full cooccurrence network, especially for rare entities, we observe that different embedding methods model different types of relations, thereby indicating the potential for ensemble methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2019

Word Embeddings for Entity-annotated Texts

Many information retrieval and natural language processing tasks due to ...
research
10/11/2021

A Comprehensive Comparison of Word Embeddings in Event Entity Coreference Resolution

Coreference Resolution is an important NLP task and most state-of-the-ar...
research
03/01/2016

Characterizing Diseases from Unstructured Text: A Vocabulary Driven Word2vec Approach

Traditional disease surveillance can be augmented with a wide variety of...
research
06/28/2021

Modelling Monotonic and Non-Monotonic Attribute Dependencies with Embeddings: A Theoretical Analysis

During the last decade, entity embeddings have become ubiquitous in Arti...
research
07/29/2019

One-to-X analogical reasoning on word embeddings: a case for diachronic armed conflict prediction from news texts

We extend the well-known word analogy task to a one-to-X formulation, in...
research
03/14/2018

LSH Microbatches for Stochastic Gradients: Value in Rearrangement

Metric embeddings are immensely useful representation of interacting ent...
research
08/05/2018

Instantiation

In computational linguistics, a large body of work exists on distributed...

Please sign up or login with your details

Forgot password? Click here to reset