Joint Word Representation Learning using a Corpus and a Semantic Lexicon

11/19/2015
by   Danushka Bollegala, et al.
0

Methods for learning word representations using large text corpora have received much attention lately due to their impressive performance in numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and word analogy detection. Despite their success, these data-driven word representation learning methods do not consider the rich semantic relational structure between words in a co-occurring context. On the other hand, already much manual effort has gone into the construction of semantic lexicons such as the WordNet that represent the meanings of words by defining the various relationships that exist among the words in a language. We consider the question, can we improve the word representations learnt using a corpora by integrating the knowledge from semantic lexicons?. For this purpose, we propose a joint word representation learning method that simultaneously predicts the co-occurrences of two words in a sentence subject to the relational constrains given by the semantic lexicon. We use relations that exist between words in the lexicon to regularize the word representations learnt from the corpus. Our proposed method statistically significantly outperforms previously proposed methods for incorporating semantic lexicons into word representations on several benchmark datasets for semantic similarity and word analogy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2014

Learning Word Representations from Relational Graphs

Attributes of words and relations between two words are central to numer...
research
05/01/2015

Embedding Semantic Relations into Word Representations

Learning representations for semantic relations is important for various...
research
07/07/2016

Representing Verbs with Rich Contexts: an Evaluation on Verb Similarity

Several studies on sentence processing suggest that the mental lexicon k...
research
09/04/2017

Learning Neural Word Salience Scores

Measuring the salience of a word is an essential step in numerous NLP ta...
research
05/19/2023

Contextualized Word Vector-based Methods for Discovering Semantic Differences with No Training nor Word Alignment

In this paper, we propose methods for discovering semantic differences i...
research
04/12/2020

Bayesian Hierarchical Words Representation Learning

This paper presents the Bayesian Hierarchical Words Representation (BHWR...
research
03/02/2018

Hybrid Model For Word Prediction Using Naive Bayes and Latent Information

Historically, the Natural Language Processing area has been given too mu...

Please sign up or login with your details

Forgot password? Click here to reset