Truly Bayesian Entropy Estimation

12/13/2022
by   Ioannis Papageorgiou, et al.
0

Estimating the entropy rate of discrete time series is a challenging problem with important applications in numerous areas including neuroscience, genomics, image processing and natural language processing. A number of approaches have been developed for this task, typically based either on universal data compression algorithms, or on statistical estimators of the underlying process distribution. In this work, we propose a fully-Bayesian approach for entropy estimation. Building on the recently introduced Bayesian Context Trees (BCT) framework for modelling discrete time series as variable-memory Markov chains, we show that it is possible to sample directly from the induced posterior on the entropy rate. This can be used to estimate the entire posterior distribution, providing much richer information than point estimates. We develop theoretical results for the posterior distribution of the entropy rate, including proofs of consistency and asymptotic normality. The practical utility of the method is illustrated on both simulated and real-world data, where it is found to outperform state-of-the-art alternatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2022

Posterior Representations for Bayesian Context Trees: Sampling, Estimation and Convergence

We revisit the Bayesian Context Trees (BCT) modelling framework for disc...
research
03/08/2022

Change-point Detection and Segmentation of Discrete Data using Bayesian Context Trees

A new Bayesian modelling framework is introduced for piece-wise homogene...
research
11/04/2022

Context-tree weighting and Bayesian Context Trees: Asymptotic and non-asymptotic justifications

The Bayesian Context Trees (BCT) framework is a recently introduced, gen...
research
07/29/2020

Bayesian Context Trees: Modelling and exact inference for discrete time series

We develop a new Bayesian modelling framework for the class of higher-or...
research
10/13/2018

A Geometric Analysis of Time Series Leading to Information Encoding and a New Entropy Measure

A time series is uniquely represented by its geometric shape, which also...
research
09/23/2020

Estimating entropy rate from censored symbolic time series: a test for time-irreversibility

In this work we introduce a method for estimating entropy rate and entro...
research
11/09/2019

Estimation of entropy measures for categorical variables with spatial correlation

Entropy is a measure of heterogeneity widely used in applied sciences, o...

Please sign up or login with your details

Forgot password? Click here to reset