Human and Machine Judgements for Russian Semantic Relatedness

08/31/2017
by   Alexander Panchenko, et al.
0

Semantic relatedness of terms represents similarity of meaning by a numerical score. On the one hand, humans easily make judgments about semantic relatedness. On the other hand, this kind of information is useful in language processing systems. While semantic relatedness has been extensively studied for English using numerous language resources, such as associative norms, human judgments, and datasets generated from lexical databases, no evaluation resources of this kind have been available for Russian to date. Our contribution addresses this problem. We present five language resources of different scale and purpose for Russian semantic relatedness, each being a list of triples (word_i, word_j, relatedness_ij). Four of them are designed for evaluation of systems for computing semantic relatedness, complementing each other in terms of the semantic relation type they represent. These benchmarks were used to organize a shared task on Russian semantic relatedness, which attracted 19 teams. We use one of the best approaches identified in this competition to generate the fifth high-coverage resource, the first open distributional thesaurus of Russian. Multiple evaluations of this thesaurus, including a large-scale crowdsourcing study involving native speakers, indicate its high accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2018

RUSSE: The First Workshop on Russian Semantic Similarity

The paper gives an overview of the Russian Semantic Similarity Evaluatio...
research
08/06/2016

HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

We introduce HyperLex - a dataset and evaluation resource that quantifie...
research
12/23/2017

A Framework for Enriching Lexical Semantic Resources with Distributional Semantics

We present an approach to combining distributional semantic representati...
research
07/06/2020

A Broad-Coverage Deep Semantic Lexicon for Verbs

Progress on deep language understanding is inhibited by the lack of a br...
research
10/05/2020

Exploring Semantic Capacity of Terms

We introduce and study semantic capacity of terms. For example, the sema...
research
11/08/2017

Improving Hypernymy Extraction with Distributional Semantic Classes

In this paper, we show for the first time how distributionally-induced s...
research
11/27/2019

Large-Scale Noun Compound Interpretation Using Bootstrapping and the Web as a Corpus

Responding to the need for semantic lexical resources in natural languag...

Please sign up or login with your details

Forgot password? Click here to reset