Watset: Automatic Induction of Synsets from a Graph of Synonyms

04/24/2017
by   Dmitry Ustalov, et al.
0

This paper presents a new graph-based approach that induces synsets using synonymy dictionaries and word embeddings. First, we build a weighted graph of synonyms extracted from commonly available resources, such as Wiktionary. Second, we apply word sense induction to deal with ambiguous words. Finally, we cluster the disambiguated version of the ambiguous input graph into synsets. Our meta-clustering approach lets us use an efficient hard clustering algorithm to perform a fuzzy clustering of the graph. Despite its simplicity, our approach shows excellent results, outperforming five competitive state-of-the-art methods in terms of F-score on three gold standard datasets for English and Russian derived from large-scale manually constructed lexical resources.

READ FULL TEXT
research
08/20/2018

Local-Global Graph Clustering with Applications in Sense and Frame Induction

We present Watset, a new meta-algorithm for fuzzy graph clustering. This...
research
08/30/2017

Fighting with the Sparsity of Synonymy Dictionaries

Graph-based synset induction methods, such as MaxMax and Watset, induce ...
research
05/12/2018

Unsupervised Semantic Frame Induction using Triclustering

We use dependency triples automatically extracted from a Web-scale corpu...
research
09/28/2022

RuDSI: graph-based word sense induction dataset for Russian

We present RuDSI, a new benchmark for word sense induction (WSI) in Russ...
research
05/23/2018

How much does a word weigh? Weighting word embeddings for word sense induction

The paper describes our participation in the first shared task on word s...
research
05/27/2021

Semantic Frame Induction using Masked Word Embeddings and Two-Step Clustering

Recent studies on semantic frame induction show that relatively high per...
research
09/26/2021

An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces

Much recent work in bilingual lexicon induction (BLI) views word embeddi...

Please sign up or login with your details

Forgot password? Click here to reset