Derivational Morphological Relations in Word Embeddings

06/06/2019
by   Tomáš Musil, et al.
0

Derivation is a type of a word-formation process which creates new words from existing ones by adding, changing or deleting affixes. In this paper, we explore the potential of word embeddings to identify properties of word derivations in the morphologically rich Czech language. We extract derivational relations between pairs of words from DeriNet, a Czech lexical network, which organizes almost one million Czech lemmata into derivational trees. For each such pair, we compute the difference of the embeddings of the two words, and perform unsupervised clustering of the resulting vectors. Our results show that these clusters largely match manually annotated semantic categories of the derivational relations (e.g. the relation 'bake--baker' belongs to category 'actor', and a correct clustering puts it into the same cluster as 'govern--governor').

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2015

Task-Oriented Learning of Word Embeddings for Semantic Relation Classification

We present a novel learning method for word embeddings designed for rela...
research
05/20/2020

Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources

In this work, we present an effective method for semantic specialization...
research
07/04/2019

Morphological Word Embeddings

Linguistic similarity is multi-faceted. For instance, two words may be s...
research
07/17/2019

Analysis of Word Embeddings using Fuzzy Clustering

In data dominated systems and applications, a concept of representing wo...
research
08/05/2018

Instantiation

In computational linguistics, a large body of work exists on distributed...
research
09/05/2015

Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning

Recent work on word embeddings has shown that simple vector subtraction ...
research
02/23/2021

Paraphrases do not explain word analogies

Many types of distributional word embeddings (weakly) encode linguistic ...

Please sign up or login with your details

Forgot password? Click here to reset