On a Class of Markov Order Estimators Based on PPM and Other Universal Codes

03/10/2020
by   Łukasz Dębowski, et al.
0

We investigate a class of estimators of the Markov order for stationary ergodic processes which form a slight modification of the construction by Merhav, Gutman, and Ziv (1989). Both kinds of estimators compare the estimate of the entropy rate given by a universal code with the empirical conditional entropy of a string and return the order for which the two quantities are approximately equal. However, our modification, which we call universal Markov orders, satisfies a few attractive properties, not shown by Merhav, Gutman, and Ziv (1989) for their original definition. Firstly, the universal Markov orders are almost surely consistent, without any restrictions. Secondly, they are upper bounded asymptotically by the logarithm of the string length divided by the entropy rate. Thirdly, if we choose the Prediction by Partial Matching (PPM) as the universal code then the number of distinct substrings of the length equal to the universal Markov order constitutes an upper bound for the block mutual information. Thus universal Markov orders can be also used indirectly for quantification of long memory for an ergodic process.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2022

Universal Densities Exist for Every Finite Reference Measure

As it is known, universal codes, which estimate the entropy rate consist...
research
09/18/2021

A Tighter Upper Bound of the Expansion Factor for Universal Coding of Integers and Its Code Constructions

In entropy coding, universal coding of integers (UCI) is a binary univer...
research
09/27/2022

Local Grammar-Based Coding Revisited

We revisit the problem of minimal local grammar-based coding. In this se...
research
09/20/2022

Recurrence times, waiting times and universal entropy production estimators

The universal typical-signal estimators of entropy and cross entropy bas...
research
02/01/2023

Order-Preserving Squares in Strings

An order-preserving square in a string is a fragment of the form uv wher...
research
04/15/2022

Generalized Universal Coding of Integers

Universal coding of integers (UCI) is a class of variable-length code, s...
research
02/01/2018

Redundancy of Markov Family with Unbounded Memory

We study the redundancy of universally compressing strings X_1,..., X_n ...

Please sign up or login with your details

Forgot password? Click here to reset