GM-CTSC at SemEval-2020 Task 1: Gaussian Mixtures Cross Temporal Similarity Clustering

05/20/2020
by   Pierluigi Cassotti, et al.
0

This paper describes the system proposed for the SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection. We focused our approach on the detection problem. Given the semantics of words captured by temporal word embeddings in different time periods, we investigate the use of unsupervised methods to detect when the target word has gained or loosed senses. To this end, we defined a new algorithm based on Gaussian Mixture Models to cluster the target similarities computed over the two periods. We compared the proposed approach with a number of similarity-based thresholds. We found that, although the performance of the detection methods varies across the word embedding algorithms, the combination of Gaussian Mixture with Temporal Referencing resulted in our best system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2019

Learning Multi-Sense Word Distributions using Approximate Kullback-Leibler Divergence

Learning word representations has garnered greater attention in the rece...
research
11/19/2015

Gaussian Mixture Embeddings for Multiple Word Prototypes

Recently, word representation has been increasingly focused on for its e...
research
07/10/2020

GloVeInit at SemEval-2020 Task 1: Using GloVe Vector Initialization for Unsupervised Lexical Semantic Change Detection

This paper presents a vector initialization approach for the SemEval2020...
research
11/05/2020

QMUL-SDS @ DIACR-Ita: Evaluating Unsupervised Diachronic Lexical Semantics Classification in Italian

In this paper, we present the results and main findings of our system fo...
research
05/16/2020

Unsupervised Embedding-based Detection of Lexical Semantic Changes

This paper describes EmbLexChange, a system introduced by the "Life-Lang...
research
02/10/2017

UsingWord Embedding for Cross-Language Plagiarism Detection

This paper proposes to use distributed representation of words (word emb...
research
11/23/2022

Unsupervised User-Based Insider Threat Detection Using Bayesian Gaussian Mixture Models

Insider threats are a growing concern for organizations due to the amoun...

Please sign up or login with your details

Forgot password? Click here to reset