Statistical analysis of word flow among five Indo-European languages

01/17/2023
by   Josué Ely Molina, et al.
0

A recent increase in data availability has allowed the possibility to perform different statistical linguistic studies. Here we use the Google Books Ngram dataset to analyze word flow among English, French, German, Italian, and Spanish. We study what we define as “migrant words”, a type of loanwords that do not change their spelling. We quantify migrant words from one language to another for different decades, and notice that most migrant words can be aggregated in semantic fields and associated to historic events. We also study the statistical properties of accumulated migrant words and their rank dynamics. We propose a measure of use of migrant words that could be used as a proxy of cultural influence. Our methodology is not exempt of caveats, but our results are encouraging to promote further studies in this direction.

READ FULL TEXT
research
07/21/2021

A Statistical Model of Word Rank Evolution

The availability of large linguistic data sets enables data-driven appro...
research
05/29/2017

Dynamics of core of language vocabulary

Studies of the overall structure of vocabulary and its dynamics became p...
research
09/21/2019

Generating Timelines by Modeling Semantic Change

Though languages can evolve slowly, they can also react strongly to dram...
research
10/07/2018

Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Transliteration converts words in a source language (e.g., English) into...
research
07/02/2022

Language statistics at different spatial, temporal, and grammatical scales

Statistical linguistics has advanced considerably in recent decades as d...
research
03/03/2015

Complexity and universality in the long-range order of words

As is the case of many signals produced by complex systems, language pre...
research
04/04/2016

In narrative texts punctuation marks obey the same statistics as words

From a grammar point of view, the role of punctuation marks in a sentenc...

Please sign up or login with your details

Forgot password? Click here to reset