Semantic Relatedness and Taxonomic Word Embeddings

02/14/2020
by   Magdalena Kacmajor, et al.
0

This paper connects a series of papers dealing with taxonomic word embeddings. It begins by noting that there are different types of semantic relatedness and that different lexical representations encode different forms of relatedness. A particularly important distinction within semantic relatedness is that of thematic versus taxonomic relatedness. Next, we present a number of experiments that analyse taxonomic embeddings that have been trained on a synthetic corpus that has been generated via a random walk over a taxonomy. These experiments demonstrate how the properties of the synthetic corpus, such as the percentage of rare words, are affected by the shape of the knowledge graph the corpus is generated from. Finally, we explore the interactions between the relative sizes of natural and synthetic corpora on the performance of embeddings when taxonomic and thematic embeddings are combined.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2018

Effects of Word Embeddings on Neural Network-based Pitch Accent Detection

Pitch accent detection often makes use of both acoustic and lexical feat...
research
11/28/2019

A New Corpus for Low-Resourced Sindhi Language with Word Embeddings

Representing words and phrases into dense vectors of real numbers which ...
research
04/06/2017

The Interplay of Semantics and Morphology in Word Embeddings

We explore the ability of word embeddings to capture both semantic and m...
research
08/05/2020

An exploration of the encoding of grammatical gender in word embeddings

The vector representation of words, known as word embeddings, has opened...
research
10/05/2016

Comparative study of LSA vs Word2vec embeddings in small corpora: a case study in dreams database

Word embeddings have been extensively studied in large text datasets. Ho...
research
11/13/2020

Learning language variations in news corpora through differential embeddings

There is an increasing interest in the NLP community in capturing variat...
research
12/25/2017

Generative Adversarial Nets for Multiple Text Corpora

Generative adversarial nets (GANs) have been successfully applied to the...

Please sign up or login with your details

Forgot password? Click here to reset