Integrating Lexical Knowledge in Word Embeddings using Sprinkling and Retrofitting

12/14/2019
by   Aakash Srinivasan, et al.
0

Neural network based word embeddings, such as Word2Vec and GloVe, are purely data driven in that they capture the distributional information about words from the training corpus. Past works have attempted to improve these embeddings by incorporating semantic knowledge from lexical resources like WordNet. Some techniques like retrofitting modify word embeddings in the post-processing stage while some others use a joint learning approach by modifying the objective function of neural networks. In this paper, we discuss two novel approaches for incorporating semantic knowledge into word embeddings. In the first approach, we take advantage of Levy et al's work which showed that using SVD based methods on co-occurrence matrix provide similar performance to neural network based embeddings. We propose a 'sprinkling' technique to add semantic relations to the co-occurrence matrix directly before factorization. In the second approach, WordNet similarity scores are used to improve the retrofitting method. We evaluate the proposed methods in both intrinsic and extrinsic tasks and observe significant improvements over the baselines in many of the datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2020

Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources

In this work, we present an effective method for semantic specialization...
research
11/26/2018

Implanting Rational Knowledge into Distributed Representation at Morpheme Level

Previously, researchers paid no attention to the creation of unambiguous...
research
04/10/2017

Word Embeddings via Tensor Factorization

Most popular word embedding techniques involve implicit or explicit fact...
research
07/30/2019

SenseFitting: Sense Level Semantic Specialization of Word Embeddings for Word Sense Disambiguation

We introduce a neural network-based system of Word Sense Disambiguation ...
research
01/22/2021

Enhanced word embeddings using multi-semantic representation through lexical chains

The relationship between words in a sentence often tells us more about t...
research
10/09/2015

Controlled Experiments for Word Embeddings

An experimental approach to studying the properties of word embeddings i...
research
05/27/2019

An Empirical Study on Post-processing Methods for Word Embeddings

Word embeddings learnt from large corpora have been adopted in various a...

Please sign up or login with your details

Forgot password? Click here to reset