On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions

09/02/2020
by   Ziqiao Wang, et al.
2

SkipGram word embedding models with negative sampling, or SGN in short, is an elegant family of word embedding models. In this paper, we formulate a framework for word embedding, referred to as Word-Context Classification (WCC), that generalizes SGN to a wide family of models. The framework, utilizing some "noise examples", is justified through a theoretical analysis. The impact of noise distribution on the learning of the WCC embedding models is studied experimentally, suggesting that the best noise distribution is in fact the data distribution, in terms of both the embedding performance and the speed of convergence during training. Along our way, we discover several novel embedding models that outperform the existing WCC models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2019

CatE: Category-Name GuidedWord Embedding

Unsupervised word embedding has benefited a wide spectrum of NLP tasks d...
research
12/05/2017

EmTaggeR: A Word Embedding Based Novel Method for Hashtag Recommendation on Twitter

The hashtag recommendation problem addresses recommending (suggesting) o...
research
04/22/2018

Word Embedding Perturbation for Sentence Classification

In this technique report, we aim to mitigate the overfitting problem of ...
research
04/01/2019

Syntactic Interchangeability in Word Embedding Models

Nearest neighbors in word embedding models are commonly observed to be s...
research
01/10/2017

Implicitly Incorporating Morphological Information into Word Embedding

In this paper, we propose three novel models to enhance word embedding b...
research
04/05/2017

Linear Ensembles of Word Embedding Models

This paper explores linear methods for combining several word embedding ...
research
08/01/2021

Realised Volatility Forecasting: Machine Learning via Financial Word Embedding

We develop FinText, a novel, state-of-the-art, financial word embedding ...

Please sign up or login with your details

Forgot password? Click here to reset