Distributed representation of multi-sense words: A loss-driven approach

04/14/2019
by   Saurav Manchanda, et al.
0

Word2Vec's Skip Gram model is the current state-of-the-art approach for estimating the distributed representation of words. However, it assumes a single vector per word, which is not well-suited for representing words that have multiple senses. This work presents LDMI, a new model for estimating distributional representations of words. LDMI relies on the idea that, if a word carries multiple senses, then having a different representation for each of its senses should lead to a lower loss associated with predicting its co-occurring words, as opposed to the case when a single vector representation is used for all the senses. After identifying the multi-sense words, LDMI clusters the occurrences of these words to assign a sense to each occurrence. Experiments on the contextual word similarity task show that LDMI leads to better performance than competing approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2019

Multi Sense Embeddings from Topic Models

Distributed word embeddings have yielded state-of-the-art performance in...
research
06/24/2016

A Game-Theoretic Approach to Word Sense Disambiguation

This paper presents a new model for word sense disambiguation formulated...
research
02/17/2021

Contextual Skipgram: Training Word Representation Using Context Information

The skip-gram (SG) model learns word representation by predicting the wo...
research
04/01/2021

HLE-UPC at SemEval-2021 Task 5: Multi-Depth DistilBERT for Toxic Spans Detection

This paper presents our submission to SemEval-2021 Task 5: Toxic Spans D...
research
11/19/2015

Gaussian Mixture Embeddings for Multiple Word Prototypes

Recently, word representation has been increasingly focused on for its e...
research
02/22/2017

One Representation per Word - Does it make Sense for Composition?

In this paper, we investigate whether an a priori disambiguation of word...
research
09/26/2017

Polysemy Detection in Distributed Representation of Word Sense

In this paper, we propose a statistical test to determine whether a give...

Please sign up or login with your details

Forgot password? Click here to reset