Low-dimensional Semantic Space: from Text to Word Embedding

11/03/2019
by   Xiaolei Lu, et al.
0

This article focuses on the study of Word Embedding, a feature-learning technique in Natural Language Processing that maps words or phrases to low-dimensional vectors. Beginning with the linguistic theories concerning contextual similarities - "Distributional Hypothesis" and "Context of Situation", this article introduces two ways of numerical representation of text: One-hot and Distributed Representation. In addition, this article presents statistical-based Language Models(such as Co-occurrence Matrix and Singular Value Decomposition) as well as Neural Network Language Models (NNLM, such as Continuous Bag-of-Words and Skip-Gram). This article also analyzes how Word Embedding can be applied to the study of word-sense disambiguation and diachronic linguistics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2017

word representation or word embedding in Persian text

Text processing is one of the sub-branches of natural language processin...
research
10/26/2020

Robust and Consistent Estimation of Word Embedding for Bangla Language by fine-tuning Word2Vec Model

Word embedding or vector representation of word holds syntactical and se...
research
03/05/2018

Calculated attributes of synonym sets

The goal of formalization, proposed in this paper, is to bring together,...
research
05/18/2020

Reconstructing Maps from Text

Previous research has demonstrated that Distributional Semantic Models (...
research
04/30/2022

To Know by the Company Words Keep and What Else Lies in the Vicinity

The development of state-of-the-art (SOTA) Natural Language Processing (...
research
09/14/2020

A Comparison of Two Fluctuation Analyses for Natural Language Clustering Phenomena: Taylor and Ebeling Neiman Methods

This article considers the fluctuation analysis methods of Taylor and Eb...
research
02/24/2017

Consistent Alignment of Word Embedding Models

Word embedding models offer continuous vector representations that can c...

Please sign up or login with your details

Forgot password? Click here to reset