RUSSE'2018: A Shared Task on Word Sense Induction for the Russian Language

03/15/2018
by   Alexander Panchenko, et al.
0

The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic languages, such as rich morphology and free word order. The participants were asked to group contexts with a given word in accordance with its senses that were not provided beforehand. For instance, given a word "bank" and a set of contexts with this word, e.g. "bank is a financial institution that accepts deposits" and "river bank is a slope beside a body of water", a participant was asked to cluster such contexts in the unknown in advance number of clusters corresponding to, in this case, the "company" and the "area" senses of the word "bank". For the purpose of this evaluation campaign, we developed three new evaluation datasets based on sense inventories that have different sense granularity. The contexts in these datasets were sampled from texts of Wikipedia, the academic corpus of Russian, and an explanatory dictionary of Russian. Overall 18 teams participated in the competition submitting 383 models. Multiple teams managed to substantially outperform competitive state-of-the-art baselines from the previous years based on sense embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2018

How much does a word weigh? Weighting word embeddings for word sense induction

The paper describes our participation in the first shared task on word s...
research
05/22/2020

RUSSE'2020: Findings of the First Taxonomy Enrichment Task for the Russian language

This paper describes the results of the first shared task on taxonomy en...
research
11/22/2018

AutoSense Model for Word Sense Induction

Word sense induction (WSI), or the task of automatically discovering mul...
research
06/15/2022

The SIGMORPHON 2022 Shared Task on Morpheme Segmentation

The SIGMORPHON 2022 shared task on morpheme segmentation challenged syst...
research
06/17/2016

Sense Embedding Learning for Word Sense Induction

Conventional word sense induction (WSI) methods usually represent each i...
research
09/28/2022

RuDSI: graph-based word sense induction dataset for Russian

We present RuDSI, a new benchmark for word sense induction (WSI) in Russ...
research
06/24/2023

UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation

We describe the systems of the University of Alberta team for the SemEva...

Please sign up or login with your details

Forgot password? Click here to reset