Semantic Composition and Decomposition: From Recognition to Generation

05/30/2014
by   Peter D. Turney, et al.
0

Semantic composition is the task of understanding the meaning of text by composing the meanings of the individual words in the text. Semantic decomposition is the task of understanding the meaning of an individual word by decomposing it into various aspects (factors, constituents, components) that are latent in the meaning of the word. We take a distributional approach to semantics, in which a word is represented by a context vector. Much recent work has considered the problem of recognizing compositions and decompositions, but we tackle the more difficult generation problem. For simplicity, we focus on noun-modifier bigrams and noun unigrams. A test for semantic composition is, given context vectors for the noun and modifier in a noun-modifier bigram ("red salmon"), generate a noun unigram that is synonymous with the given bigram ("sockeye"). A test for semantic decomposition is, given a context vector for a noun unigram ("snifter"), generate a noun-modifier bigram that is synonymous with the given unigram ("brandy glass"). With a vocabulary of about 73,000 unigrams from WordNet, there are 73,000 candidate unigram compositions for a bigram and 5,300,000,000 (73,000 squared) candidate bigram decompositions for a unigram. We generate ranked lists of potential solutions in two passes. A fast unsupervised learning algorithm generates an initial list of candidates and then a slower supervised learning algorithm refines the list. We evaluate the candidate solutions by comparing them to WordNet synonym sets. For decomposition (unigram to bigram), the top 100 most highly ranked bigrams include a WordNet synonym of the given unigram 50.7 composition (bigram to unigram), the top 100 most highly ranked unigrams include a WordNet synonym of the given bigram 77.8

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2020

Context based Text-generation using LSTM networks

Long short-term memory(LSTM) units on sequence-based models are being us...
research
01/04/2017

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Much like sentences are composed of words, words themselves are composed...
research
10/06/2020

Automatic Metaphor Interpretation Using Word Embeddings

We suggest a model for metaphor interpretation using word embeddings tra...
research
11/23/2019

SemEval-2013 Task 4: Free Paraphrases of Noun Compounds

In this paper, we describe SemEval-2013 Task 4: the definition, the data...
research
11/15/2021

An Outcome Test of Discrimination for Ranked Lists

This paper extends Becker (1957)'s outcome test of discrimination to set...
research
04/23/2017

Deep Keyphrase Generation

Keyphrase provides highly-summative information that can be effectively ...
research
05/29/2020

Automatic Generation of Topic Labels

Topic modelling is a popular unsupervised method for identifying the und...

Please sign up or login with your details

Forgot password? Click here to reset