Understanding Composition of Word Embeddings via Tensor Decomposition

02/02/2019
by   Abraham Frandsen, et al.
0

Word embedding is a powerful tool in natural language processing. In this paper we consider the problem of word embedding composition -- given vector representations of two words, compute a vector for the entire phrase. We give a generative model that can capture specific syntactic relations between words. Under our model, we prove that the correlations between three words (measured by their PMI) form a tensor that has an approximate low rank Tucker decomposition. The result of the Tucker decomposition gives the word embeddings as well as a core tensor, which can be used to produce better compositions of the word embeddings. We also complement our theoretical results with experiments that verify our assumptions, and demonstrate the effectiveness of the new composition method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2016

PSDVec: a Toolbox for Incremental and Scalable Word Embedding

PSDVec is a Python/Perl toolbox that learns word embeddings, i.e. the ma...
research
04/10/2017

Word Embeddings via Tensor Factorization

Most popular word embedding techniques involve implicit or explicit fact...
research
01/09/2020

Multiplex Word Embeddings for Selectional Preference Acquisition

Conventional word embeddings represent words with fixed vectors, which a...
research
09/19/2017

An Optimality Proof for the PairDiff operator for Representing Relations between Words

Representing the semantic relations that exist between two given words (...
research
05/23/2018

Embedding Syntax and Semantics of Prepositions via Tensor Decomposition

Prepositions are among the most frequent words in English and play compl...
research
02/21/2018

CoVeR: Learning Covariate-Specific Vector Representations with Tensor Decompositions

Word embedding is a useful approach to capture co-occurrence structures ...
research
08/21/2017

Probabilistic Relation Induction in Vector Space Embeddings

Word embeddings have been found to capture a surprisingly rich amount of...

Please sign up or login with your details

Forgot password? Click here to reset