The Global Anchor Method for Quantifying Linguistic Shifts and Domain Adaptation

12/12/2018
by   Zi Yin, et al.
0

Language is dynamic, constantly evolving and adapting with respect to time, domain or topic. The adaptability of language is an active research area, where researchers discover social, cultural and domain-specific changes in language using distributional tools such as word embeddings. In this paper, we introduce the global anchor method for detecting corpus-level language shifts. We show both theoretically and empirically that the global anchor method is equivalent to the alignment method, a widely-used method for comparing word embeddings, in terms of detecting corpus-level language shifts. Despite their equivalence in terms of detection abilities, we demonstrate that the global anchor method is superior in terms of applicability as it can compare embeddings of different dimensionalities. Furthermore, the global anchor method has implementation and parallelization advantages. We show that the global anchor method reveals fine structures in the evolution of language and domain adaptation. When combined with the graph Laplacian technique, the global anchor method recovers the evolution trajectory and domain clustering of disparate text corpora.

READ FULL TEXT
research
01/13/2020

Visual Storytelling via Predicting Anchor Word Embeddings in the Stories

We propose a learning model for the task of visual storytelling. The mai...
research
12/02/2019

Leveraging Contextual Embeddings for Detecting Diachronic Semantic Shift

We propose a new method that leverages contextual embeddings for the tas...
research
10/06/2022

Domain-Specific Word Embeddings with Structure Prediction

Complementary to finding good general word embeddings, an important ques...
research
02/15/2021

How COVID-19 Is Changing Our Language : Detecting Semantic Shift in Twitter Word Embeddings

Words are malleable objects, influenced by events that are reflected in ...
research
06/09/2018

Diachronic word embeddings and semantic shifts: a survey

Recent years have witnessed a surge of publications aimed at tracing tem...
research
06/09/2016

Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change

Words shift in meaning for many reasons, including cultural factors like...
research
05/16/2019

Tracing cultural diachronic semantic shifts in Russian using word embeddings: test sets and baselines

The paper introduces manually annotated test sets for the task of tracin...

Please sign up or login with your details

Forgot password? Click here to reset