Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation

01/06/2017
by   Haoyue Shi, et al.
0

Previous researches have shown that learning multiple representations for polysemous words can improve the performance of word embeddings on many tasks. However, this leads to another problem. Several vectors of a word may actually point to the same meaning, namely pseudo multi-sense. In this paper, we introduce the concept of pseudo multi-sense, and then propose an algorithm to detect such cases. With the consideration of the detected pseudo multi-sense cases, we try to refine the existing word embeddings to eliminate the influence of pseudo multi-sense. Moreover, we apply our algorithm on previous released multi-sense word embeddings and tested it on artificial word similarity tasks and the analogy task. The result of the experiments shows that diminishing pseudo multi-sense can improve the quality of word representations. Thus, our method is actually an efficient way to reduce linguistic complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2018

Understanding and Improving Multi-Sense Word Embeddings via Extended Robust Principal Component Analysis

Unsupervised learned representations of polysemous words generate a larg...
research
06/15/2017

A Mixture Model for Learning Multi-Sense Word Embeddings

Word embeddings are now a standard technique for inducing meaning repres...
research
08/26/2019

Semi-supervised Learning for Word Sense Disambiguation

This work is a study of the impact of multiple aspects in a classic unsu...
research
12/14/2018

Detecting Reliable Novel Word Senses: A Network-Centric Approach

In this era of Big Data, due to expeditious exchange of information on t...
research
06/09/2023

Word sense extension

Humans often make creative use of words to express novel senses. A long-...
research
03/30/2016

Bilingual Learning of Multi-sense Embeddings with Discrete Autoencoders

We present an approach to learning multi-sense word embeddings relying b...
research
05/12/2021

Playing Codenames with Language Graphs and Word Embeddings

Although board games and video games have been studied for decades in ar...

Please sign up or login with your details

Forgot password? Click here to reset