An Algorithm for Fuzzification of WordNets, Supported by a Mathematical Proof

06/07/2020
by   Sayyed-Ali Hossayni, et al.
0

WordNet-like Lexical Databases (WLDs) group English words into sets of synonyms called "synsets." Although the standard WLDs are being used in many successful Text-Mining applications, they have the limitation that word-senses are considered to represent the meaning associated to their corresponding synsets, to the same degree, which is not generally true. In order to overcome this limitation, several fuzzy versions of synsets have been proposed. A common trait of these studies is that, to the best of our knowledge, they do not aim to produce fuzzified versions of the existing WLD's, but build new WLDs from scratch, which has limited the attention received from the Text-Mining community, many of whose resources and applications are based on the existing WLDs. In this study, we present an algorithm for constructing fuzzy versions of WLDs of any language, given a corpus of documents and a word-sense disambiguation (WSD) system for that language. Then, using the Open-American-National-Corpus and UKB WSD as algorithm inputs, we construct and publish online the fuzzified version of English WordNet (FWN). We also propose a theoretical (mathematical) proof of the validity of its results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2019

Context based Analysis of Lexical Semantics for Hindi Language

A word having multiple senses in a text introduces the lexical semantic ...
research
07/09/2020

DISCO PAL: Diachronic Spanish Sonnet Corpus with Psychological and Affective Labels

Nowadays, there are many applications of text mining over corpus from di...
research
03/03/2023

Mapping Wordnets on the Fly with Permanent Sense Keys

Most of the major databases on the semantic web have links to Princeton ...
research
06/30/2021

A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers

We present ASDiv (Academia Sinica Diverse MWP Dataset), a diverse (in te...
research
03/11/2022

When classifying grammatical role, BERT doesn't care about word order... except when it matters

Because meaning can often be inferred from lexical semantics alone, word...
research
08/29/2022

Extracting Mathematical Concepts from Text

We investigate different systems for extracting mathematical entities fr...
research
04/07/2022

Automatic WordNet Construction using Word Sense Induction through Sentence Embeddings

Language resources such as wordnets remain indispensable tools for diffe...

Please sign up or login with your details

Forgot password? Click here to reset