Approaches to the classification of complex systems: Words, texts, and more

05/09/2022
by   Andrij Rovenchak, et al.
0

The Chapter starts with introductory information about quantitative linguistics notions, like rank–frequency dependence, Zipf's law, frequency spectra, etc. Similarities in distributions of words in texts with level occupation in quantum ensembles hint at a superficial analogy with statistical physics. This enables one to define various parameters for texts based on this physical analogy, including "temperature", "chemical potential", entropy, and some others. Such parameters provide a set of variables to classify texts serving as an example of complex systems. Moreover, texts are perhaps the easiest complex systems to collect and analyze. Similar approaches can be developed to study, for instance, genomes due to well-known linguistic analogies. We consider a couple of approaches to define nucleotide sequences in mitochondrial DNAs and viral RNAs and demonstrate their possible application as an auxiliary tool for comparative analysis of genomes. Finally, we discuss entropy as one of the parameters, which can be easily computed from rank–frequency dependences. Being a discriminating parameter in some problems of classification of complex systems, entropy can be given a proper interpretation only in a limited class of problems. Its overall role and significance remain an open issue so far.

READ FULL TEXT

page 6

page 12

research
02/07/2021

Word frequency-rank relationship in tagged texts

We analyze the frequency-rank relationship in sub-vocabularies correspon...
research
11/14/2016

Quantitative Entropy Study of Language Complexity

We study the entropy of Chinese and English texts, based on characters i...
research
06/13/2020

Words ranking and Hirsch index for identifying the core of the hapaxes in political texts

This paper deals with a quantitative analysis of the content of official...
research
08/22/2018

Deciding the status of controversial phonemes using frequency distributions; an application to semiconsonants in Spanish

Exploiting the fact that natural languages are complex systems, the pres...
research
02/04/2016

Complex Networks of Words in Fables

In this chapter we give an overview of the application of complex networ...
research
05/02/2022

A Two Parameters Equation for Word Rank-Frequency Relation

Let f (·) be the absolute frequency of words and r be the rank of words ...
research
04/11/2023

Mathematical and Linguistic Characterization of Orhan Pamuk's Nobel Works

In this study, Nobel Laureate Orhan Pamuk's works are chosen as examples...

Please sign up or login with your details

Forgot password? Click here to reset