Moving Down the Long Tail of Word Sense Disambiguation with Gloss-Informed Biencoders

05/06/2020
by   Terra Blevins, et al.
0

A major obstacle in Word Sense Disambiguation (WSD) is that word senses are not uniformly distributed, causing existing models to generally perform poorly on senses that are either rare or unseen during training. We propose a bi-encoder model that independently embeds (1) the target word with its surrounding context and (2) the dictionary definition, or gloss, of each sense. The encoders are jointly optimized in the same representation space, so that sense disambiguation can be performed by finding the nearest sense embedding for each target word embedding. Our system outperforms previous state-of-the-art models on English all-words WSD; these gains predominantly come from improved performance on rare senses, leading to a 31.1 reduction on less frequent senses over prior work. This demonstrates that rare senses can be more effectively disambiguated by modeling their definitions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2021

BERT Has Uncommon Sense: Similarity Ranking for Word Sense BERTology

An important question concerning contextualized word embedding (CWE) mod...
research
03/12/2023

MWE as WSD: Solving Multiword Expression Identification with Word Sense Disambiguation

Recent work in word sense disambiguation (WSD) utilizes encodings of the...
research
10/27/2021

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories

Word Sense Disambiguation (WSD) aims to automatically identify the exact...
research
12/14/2022

SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation

Word Sense Disambiguation (WSD) is an NLP task aimed at determining the ...
research
06/12/2020

Evaluating a Multi-sense Definition Generation Model for Multiple Languages

Most prior work on definition modeling has not accounted for polysemy, o...
research
07/24/2017

Learning Rare Word Representations using Semantic Bridging

We propose a methodology that adapts graph embedding techniques (DeepWal...
research
02/16/2021

FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary

Current models for Word Sense Disambiguation (WSD) struggle to disambigu...

Please sign up or login with your details

Forgot password? Click here to reset