RAND-WALK: A Latent Variable Model Approach to Word Embeddings

02/12/2015
by   Sanjeev Arora, et al.
0

Semantic word embeddings represent the meaning of a word via a vector, and are created by diverse methods. Many use nonlinear operations on co-occurrence statistics, and have hand-tuned hyperparameters and reweighting methods. This paper proposes a new generative model, a dynamic version of the log-linear topic model of mnih2007three. The methodological novelty is to use the prior to compute closed form expressions for word statistics. This provides a theoretical justification for nonlinear models like PMI, word2vec, and GloVe, as well as some hyperparameter choices. It also helps explain why low-dimensional semantic embeddings contain linear algebraic structure that allows solution of word analogies, as shown by mikolov2013efficient and many subsequent papers. Experimental support is provided for the generative model assumptions, the most important of which is that latent word vectors are fairly uniformly dispersed in space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2020

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

We propose a novel generative model to explore both local and global con...
research
07/22/2019

Learning dynamic word embeddings with drift regularisation

Word usage, meaning and connotation change throughout time. Diachronic w...
research
06/23/2019

Variational Sequential Labelers for Semi-Supervised Learning

We introduce a family of multitask variational methods for semi-supervis...
research
12/30/2020

SemGloVe: Semantic Co-occurrences for GloVe from BERT

GloVe learns word embeddings by leveraging statistical information from ...
research
07/22/2021

Theoretical foundations and limits of word embeddings: what types of meaning can they capture?

Measuring meaning is a central problem in cultural sociology and word em...
research
11/22/2015

On the Linear Algebraic Structure of Distributed Word Representations

In this work, we leverage the linear algebraic structure of distributed ...
research
11/12/2018

Agent Embeddings: A Latent Representation for Pole-Balancing Networks

We show that it is possible to reduce a high-dimensional object like a n...

Please sign up or login with your details

Forgot password? Click here to reset