The explicit formula of the distributions of the nonoverlapping words and its applications to statistical tests for random numbers

05/11/2021
by   Hayato Takahashi, et al.
0

Bassino et al. 2010 and Regnier et al. 1998 showed the generating functions of the distributions of the number of the occurrences of words (distributions of words for short) in finite string in the form of rational functions. However the coefficients of the expansion of the rational functions are complicated and we do not have a simple formula of the exact distributions of words from rational functions. In this paper we study the finite dimensional generating functions of the distribution of nonoverlapping words for each fixed sample size and show the explicit formula of the distributions of words for Bernoulli model. We demonstrate that 1) the tests based on the distributions of words reject the random number generator in BSD Library with p-value almost zero and 2) computation of the distributions of words in the human DNA size strings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

Bounding Zolotarev numbers using Faber rational functions

By closely following a construction by Ganelius, we construct Faber rati...
research
05/19/2020

On repetitiveness measures of Thue-Morse words

We show that the size γ(t_n) of the smallest string attractor of the nth...
research
11/29/2018

The distributions of sliding block patterns in finite samples and the inclusion-exclusion principles for partially ordered sets

In this paper we show the distributions of sliding block patterns for Be...
research
05/16/2022

Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning

We use the "map of elections" approach of Szufa et al. (AAMAS 2020) to a...
research
07/03/2022

FPS In Action: An Easy Way To Find Explicit Formulas For Interlaced Hypergeometric Sequences

Linear recurrence equations with constant coefficients define the power ...
research
08/13/2018

Clustering genomic words in human DNA using peaks and trends of distributions

In this work we seek clusters of genomic words in human DNA by studying ...
research
12/15/2015

Towards Evaluation of Cultural-scale Claims in Light of Topic Model Sampling Effects

Cultural-scale models of full text documents are prone to over-interpret...

Please sign up or login with your details

Forgot password? Click here to reset