Zipf's law in 50 languages: its structural pattern, linguistic interpretation, and cognitive motivation

07/05/2018
by   Shuiyuan Yu, et al.
0

Zipf's law has been found in many human-related fields, including language, where the frequency of a word is persistently found as a power law function of its frequency rank, known as Zipf's law. However, there is much dispute whether it is a universal law or a statistical artifact, and little is known about what mechanisms may have shaped it. To answer these questions, this study conducted a large scale cross language investigation into Zipf's law. The statistical results show that Zipf's laws in 50 languages all share a 3-segment structural pattern, with each segment demonstrating distinctive linguistic properties and the lower segment invariably bending downwards to deviate from theoretical expectation. This finding indicates that this deviation is a fundamental and universal feature of word frequency distributions in natural languages, not the statistical error of low frequency words. A computer simulation based on the dual-process theory yields Zipf's law with the same structural pattern, suggesting that Zipf's law of natural languages are motivated by common cognitive mechanisms. These results show that Zipf's law in languages is motivated by cognitive mechanisms like dual-processing that govern human verbal behaviors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2019

Polysemy and brevity versus frequency in language

The pioneering research of G. K. Zipf on the relationship between word f...
research
12/01/2020

Statistical patterns of word frequency suggesting the probabilistic nature of human languages

Traditional linguistic theories have largely regard language as a formal...
research
05/05/2020

Self-organizing Pattern in Multilayer Network for Words and Syllables

One of the ultimate goals for linguists is to find universal properties ...
research
01/09/2020

The empirical structure of word frequency distributions

The frequencies at which individual words occur across languages follow ...
research
08/23/2022

Universality and diversity in word patterns

Words are fundamental linguistic units that connect thoughts and things ...
research
08/22/2018

Deciding the status of controversial phonemes using frequency distributions; an application to semiconsonants in Spanish

Exploiting the fact that natural languages are complex systems, the pres...
research
05/04/2016

Compression and the origins of Zipf's law for word frequencies

Here we sketch a new derivation of Zipf's law for word frequencies based...

Please sign up or login with your details

Forgot password? Click here to reset