WSD-algorithm based on new method of vector-word contexts proximity calculation via epsilon-filtration

05/24/2018
by   Alexander Kirillov, et al.
0

The problem of word sense disambiguation (WSD) is considered in the article. Given a set of synonyms (synsets) and sentences with these synonyms. It is necessary to select the meaning of the word in the sentence automatically. 1285 sentences were tagged by experts, namely, one of the dictionary meanings was selected by experts for target words. To solve the WSD-problem, an algorithm based on a new method of vector-word contexts proximity calculation is proposed. In order to achieve higher accuracy, a preliminary epsilon-filtering of words is performed, both in the sentence and in the set of synonyms. An extensive program of experiments was carried out. Four algorithms are implemented, including a new algorithm. Experiments have shown that in a number of cases the new algorithm shows better results. The developed software and the tagged corpus have an open license and are available online. Wiktionary and Wikisource are used. A brief description of this work can be viewed in slides (https://goo.gl/9ak6Gt). Video lecture in Russian on this research is available online (https://youtu.be/-DLmRkepf58).

READ FULL TEXT
research
05/24/2018

WSD algorithm based on a new method of vector-word contexts proximity calculation via epsilon-filtration

The problem of word sense disambiguation (WSD) is considered in the arti...
research
03/05/2018

Calculated attributes of synonym sets

The goal of formalization, proposed in this paper, is to bring together,...
research
10/27/2021

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories

Word Sense Disambiguation (WSD) aims to automatically identify the exact...
research
08/25/2019

A Method for Estimating the Proximity of Vector Representation Groups in Multidimensional Space. On the Example of the Paraphrase Task

The following paper presents a method of comparing two sets of vectors. ...
research
06/06/2019

From Receptive to Productive: Learning to Use Confusing Words through Automatically Selected Example Sentences

Knowing how to use words appropriately has been a key to improving langu...
research
06/21/2013

Discriminative Training: Learning to Describe Video with Sentences, from Video Described with Sentences

We present a method for learning word meanings from complex and realisti...
research
10/25/2018

The Logoscope: a Semi-Automatic Tool for Detecting and Documenting French New Words

In this article we present the design and implementation of the Logoscop...

Please sign up or login with your details

Forgot password? Click here to reset