Is space a word, too?

10/20/2017
by   Jake Ryland Williams, et al.
0

For words, rank-frequency distributions have long been heralded for adherence to a potentially-universal phenomenon known as Zipf's law. The hypothetical form of this empirical phenomenon was refined by Benîot Mandelbrot to that which is presently referred to as the Zipf-Mandelbrot law. Parallel to this, Herbet Simon proposed a selection model potentially explaining Zipf's law. However, a significant dispute between Simon and Mandelbrot, notable empirical exceptions, and the lack of a strong empirical connection between Simon's model and the Zipf-Mandelbrot law have left the questions of universality and mechanistic generation open. We offer a resolution to these issues by exhibiting how the dark matter of word segmentation, i.e., space, punctuation, etc., connect the Zipf-Mandelbrot law to Simon's mechanistic process. This explains Mandelbrot's refinement as no more than a fudge factor, accommodating the effects of the exclusion of the rank-frequency dark matter. Thus, integrating these non-word objects resolves a more-generalized rank-frequency law. Since this relies upon the integration of space, etc., we find support for the hypothesis that all are generated by common processes, indicating from a physical perspective that space is a word, too.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2014

Zipf's Law and the Frequency of Characters or Words of Oracles

The article discusses the frequency of characters of Oracle,concluding t...
research
12/30/2017

The origins of Zipf's meaning-frequency law

In his pioneering research, G. K. Zipf observed that more frequent words...
research
12/29/2016

Verifying Heaps' law using Google Books Ngram data

This article is devoted to the verification of the empirical Heaps law i...
research
05/05/2020

Self-organizing Pattern in Multilayer Network for Words and Syllables

One of the ultimate goals for linguists is to find universal properties ...
research
06/29/2021

Damping effect in innovation processes: case studies from Twitter

Understanding the innovation process, that is the underlying mechanisms ...
research
05/24/2021

The advent and fall of a vocabulary learning bias from communicative efficiency

It is well-known that, when sufficiently young children encounter a new ...
research
05/26/2021

A Universal Law of Robustness via Isoperimetry

Classically, data interpolation with a parametrized model class is possi...

Please sign up or login with your details

Forgot password? Click here to reset