What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories

02/07/2023
by   Oscar Sainz, et al.
0

Language Models are the core for almost any Natural Language Processing system nowadays. One of their particularities is their contextualized representations, a game changer feature when a disambiguation between word senses is necessary. In this paper we aim to explore to what extent language models are capable of discerning among senses at inference time. We performed this analysis by prompting commonly used Languages Models such as BERT or RoBERTa to perform the task of Word Sense Disambiguation (WSD). We leverage the relation between word senses and domains, and cast WSD as a textual entailment problem, where the different hypothesis refer to the domains of the word senses. Our results show that this approach is indeed effective, close to supervised systems.

READ FULL TEXT
research
09/18/2019

Subword ELMo

Embedding from Language Models (ELMo) has shown to be effective for impr...
research
10/15/2022

Temporal Word Meaning Disambiguation using TimeLMs

Meaning of words constantly changes given the events in modern civilizat...
research
08/26/2020

Language Models and Word Sense Disambiguation: An Overview and Analysis

Transformer-based language models have taken many fields in NLP by storm...
research
10/25/2021

No News is Good News: A Critique of the One Billion Word Benchmark

The One Billion Word Benchmark is a dataset derived from the WMT 2011 Ne...
research
08/10/2023

Do Language Models Refer?

What do language models (LMs) do with language? Everyone agrees that the...
research
06/09/2022

Ancestor-to-Creole Transfer is Not a Walk in the Park

We aim to learn language models for Creole languages for which large vol...
research
06/04/2021

Modeling the Unigram Distribution

The unigram distribution is the non-contextual probability of finding a ...

Please sign up or login with your details

Forgot password? Click here to reset