SST-BERT at SemEval-2020 Task 1: Semantic Shift Tracing by Clustering in BERT-based Embedding Spaces

10/02/2020
by   K Vani, et al.
0

Lexical semantic change detection (also known as semantic shift tracing) is a task of identifying words that have changed their meaning over time. Unsupervised semantic shift tracing, focal point of SemEval2020, is particularly challenging. Given the unsupervised setup, in this work, we propose to identify clusters among different occurrences of each target word, considering these as representatives of different word meanings. As such, disagreements in obtained clusters naturally allow to quantify the level of semantic shift per each target word in four target languages. To leverage this idea, clustering is performed on contextualized (BERT-based) embeddings of word occurrences. The obtained results show that our approach performs well both measured separately (per language) and overall, where we surpass all provided SemEval baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2020

Unsupervised Embedding-based Detection of Lexical Semantic Changes

This paper describes EmbLexChange, a system introduced by the "Life-Lang...
research
10/07/2020

ELMo and BERT in semantic change detection for Russian

We study the effectiveness of contextualized embeddings for the task of ...
research
12/02/2019

Leveraging Contextual Embeddings for Detecting Diachronic Semantic Shift

We propose a new method that leverages contextual embeddings for the tas...
research
02/14/2023

A Psycholinguistic Analysis of BERT's Representations of Compounds

This work studies the semantic representations learned by BERT for compo...
research
01/18/2020

Capturing Evolution in Word Usage: Just Add More Clusters?

The way the words are used evolves through time, mirroring cultural or t...
research
04/04/2023

A Survey on Contextualised Semantic Shift Detection

Semantic Shift Detection (SSD) is the task of identifying, interpreting,...
research
05/15/2023

Unsupervised Semantic Variation Prediction using the Distribution of Sibling Embeddings

Languages are dynamic entities, where the meanings associated with words...

Please sign up or login with your details

Forgot password? Click here to reset