Improving Sparse Word Representations with Distributional Inference for Semantic Composition

08/24/2016
by   Thomas Kober, et al.
0

Distributional models are derived from co-occurrences in a corpus, where only a small proportion of all possible plausible co-occurrences will be observed. This results in a very sparse vector space, requiring a mechanism for inferring missing knowledge. Most methods face this challenge in ways that render the resulting word representations uninterpretable, with the consequence that semantic composition becomes hard to model. In this paper we explore an alternative which involves explicitly inferring unobserved co-occurrences using the distributional neighbourhood. We show that distributional inference improves sparse word representations on several word similarity benchmarks and demonstrate that our model is competitive with the state-of-the-art for adjective-noun, noun-noun and verb-object compositions while being fully interpretable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2017

Improving Semantic Composition with Offset Inference

Count-based distributional semantic models suffer from sparsity due to u...
research
04/11/2019

Cross-topic distributional semantic representations via unsupervised mappings

In traditional Distributional Semantic Models (DSMs) the multiple senses...
research
03/28/2017

Semi-Supervised Affective Meaning Lexicon Expansion Using Semantic and Distributed Word Representations

In this paper, we propose an extension to graph-based sentiment lexicon ...
research
06/11/2019

A Systematic Comparison of English Noun Compound Representations

Building meaningful representations of noun compounds is not trivial sin...
research
06/29/2016

A Distributional Semantics Approach to Implicit Language Learning

In the present paper we show that distributional information is particul...
research
09/01/2017

Semantic Composition via Probabilistic Model Theory

Semantic composition remains an open problem for vector space models of ...
research
09/07/2018

Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

Distributional models provide a convenient way to model semantics using ...

Please sign up or login with your details

Forgot password? Click here to reset