Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change

05/30/2016
by   William L. Hamilton, et al.
0

Understanding how words change their meanings over time is key to models of language and cultural evolution, but historical data on meaning is scarce, making theories hard to develop and test. Word embeddings show promise as a diachronic tool, but have not been carefully evaluated. We develop a robust methodology for quantifying semantic change by evaluating word embeddings (PPMI, SVD, word2vec) against known historical changes. We then use this methodology to reveal statistical laws of semantic evolution. Using six historical corpora spanning four languages and two centuries, we propose two quantitative laws of semantic change: (i) the law of conformity---the rate of semantic change scales with an inverse power-law of word frequency; (ii) the law of innovation---independent of frequency, words that are more polysemous have higher rates of semantic change.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

Zipf's laws of meaning in Catalan

In his pioneering research, G. K. Zipf formulated a couple of statistica...
research
05/30/2023

A Tale of Two Laws of Semantic Change: Predicting Synonym Changes with Distributional Semantic Models

Lexical Semantic Change is the study of how the meaning of words evolves...
research
09/21/2019

Generating Timelines by Modeling Semantic Change

Though languages can evolve slowly, they can also react strongly to dram...
research
08/05/2021

Evolution of emotion semantics

Humans possess the unique ability to communicate emotions through langua...
research
06/02/2018

Quantifying the dynamics of topical fluctuations in language

The availability of large diachronic corpora has provided the impetus fo...
research
09/10/2022

Subdiffusive semantic evolution in Indo-European languages

How do words change their meaning? Although semantic evolution is driven...
research
03/13/2019

GASC: Genre-Aware Semantic Change for Ancient Greek

Word meaning changes over time, depending on linguistic and extra-lingui...

Please sign up or login with your details

Forgot password? Click here to reset