Low-Rank Approximation of Matrices for PMI-based Word Embeddings

09/21/2019
by   Alena Sorokina, et al.
0

We perform an empirical evaluation of several methods of low-rank approximation in the problem of obtaining PMI-based word embeddings. All word vectors were trained on parts of a large corpus extracted from English Wikipedia (enwik9) which was divided into two equal-sized datasets, from which PMI matrices were obtained. A repeated measures design was used in assigning a method of low-rank approximation (SVD, NMF, QR) and dimensionality of the vectors (250, 500) to each of the PMI matrix replicates. Our experiments show that word vectors obtained from the truncated SVD achieve the best performance on two downstream tasks, similarity and analogy, compare to the other two low-rank approximation methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2017

A Hierarchical Singular Value Decomposition Algorithm for Low Rank Matrices

Singular value decomposition (SVD) is a widely used technique for dimens...
research
01/30/2022

Low Rank Approximation of Dual Complex Matrices

Dual complex numbers can represent rigid body motion in 2D spaces. Dual ...
research
11/22/2015

On the Linear Algebraic Structure of Distributed Word Representations

In this work, we leverage the linear algebraic structure of distributed ...
research
06/11/2020

Attention improves concentration when learning node embeddings

We consider the problem of predicting edges in a graph from node attribu...
research
04/18/2017

Representing Sentences as Low-Rank Subspaces

Sentences are important semantic units of natural language. A generic, d...
research
01/18/2022

A hybrid DEIM and leverage scores based method for CUR index selection

The discrete empirical interpolation method (DEIM) may be used as an ind...
research
08/16/2015

A Generative Word Embedding Model and its Low Rank Positive Semidefinite Solution

Most existing word embedding methods can be categorized into Neural Embe...

Please sign up or login with your details

Forgot password? Click here to reset