Determining the Number of Samples Required to Estimate Entropy in Natural Sequences

05/23/2018
by   Andrew D. Back, et al.
0

Calculating the Shannon entropy for symbolic sequences has been widely considered in many fields. For descriptive statistical problems such as estimating the N-gram entropy of English language text, a common approach is to use as much data as possible to obtain progressively more accurate estimates. However in some instances, only short sequences may be available. This gives rise to the question of how many samples are needed to compute entropy. In this paper, we examine this problem and propose a method for estimating the number of samples required to compute Shannon entropy for a set of ranked symbolic natural events. The result is developed using a modified Zipf-Mandelbrot law and the Dvoretzky-Kiefer-Wolfowitz inequality, and we propose an algorithm which yields an estimate for the minimum number of samples required to obtain an estimate of entropy with a given confidence level and degree of accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2020

Cumulative Tsallis Entropy for Maximum Ranked Set Sampling with Unequal Samples

In this paper, we consider the information content of maximum ranked set...
research
10/27/2018

Estimating Differential Entropy under Gaussian Convolutions

This paper studies the problem of estimating the differential entropy h(...
research
02/07/2020

On the Estimation of Information Measures of Continuous Distributions

The estimation of information measures of continuous distributions based...
research
03/03/2023

Computation of Reliability Statistics for Success-Failure Experiments

Reliability is probability of success in a success-failure experiment. C...
research
07/09/2022

Accurate estimation of dynamical quantities for nonequilibrium nanoscale system

Fluctuations of dynamical quantities are fundamental and inevitable. For...
research
03/21/2002

Entropy estimation of symbol sequences

We discuss algorithms for estimating the Shannon entropy h of finite sym...
research
05/05/2020

An improved estimate of the inverse binary entropy function

Two estimates for the inverse binary entropy function are derived using ...

Please sign up or login with your details

Forgot password? Click here to reset