Word Embedding based on Low-Rank Doubly Stochastic Matrix Decomposition

12/12/2018
by   Denis Sedov, et al.
0

Word embedding, which encodes words into vectors, is an important starting point in natural language processing and commonly used in many text-based machine learning tasks. However, in most current word embedding approaches, the similarity in embedding space is not optimized in the learning. In this paper we propose a novel neighbor embedding method which directly learns an embedding simplex where the similarities between the mapped words are optimal in terms of minimal discrepancy to the input neighborhoods. Our method is built upon two-step random walks between words via topics and thus able to better reveal the topics among the words. Experiment results indicate that our method, compared with another existing word embedding approach, is more favorable for various queries.

READ FULL TEXT
research
10/24/2022

Subspace-based Set Operations on a Pre-trained Word Embedding Space

Word embedding is a fundamental technology in natural language processin...
research
10/06/2021

A Fast Randomized Algorithm for Massive Text Normalization

Many popular machine learning techniques in natural language processing ...
research
10/28/2019

Cross-Domain Ambiguity Detection using Linear Transformation of Word Embedding Spaces

The requirements engineering process is a crucial stage of the software ...
research
06/30/2022

Using Person Embedding to Enrich Features and Data Augmentation for Classification

Today, machine learning is applied in almost any field. In machine learn...
research
10/23/2018

Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding

Developers increasingly rely on text matching tools to analyze the relat...
research
08/16/2015

A Generative Word Embedding Model and its Low Rank Positive Semidefinite Solution

Most existing word embedding methods can be categorized into Neural Embe...
research
11/05/2015

Comparing Writing Styles using Word Embedding and Dynamic Time Warping

The development of plot or story in novels is reflected in the content a...

Please sign up or login with your details

Forgot password? Click here to reset