Optimal Partitions for Nonparametric Multivariate Entropy Estimation

12/12/2021
by   Z. Keskin, et al.
0

Efficient and accurate estimation of multivariate empirical probability distributions is fundamental to the calculation of information-theoretic measures such as mutual information and transfer entropy. Common techniques include variations on histogram estimation which, whilst computationally efficient, often fail to closely approximate the probability density functions - particularly for distributions with fat tails or fine substructure, or when sample sizes are small. This paper demonstrates that the application of rotation operations can improve entropy estimates by aligning the geometry of the partition to the sample distribution. A method for generating equiprobable multivariate histograms is presented, using recursive binary partitioning, for which optimal rotations are found. Such optimal partitions were observed to be more accurate than existing techniques in estimating entropies of correlated bivariate Gaussian distributions with known theoretical values, across varying sample sizes (99% CI).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2020

Information Theory in Density Destructors

Density destructors are differentiable and invertible transforms that ma...
research
02/07/2020

On the Estimation of Information Measures of Continuous Distributions

The estimation of information measures of continuous distributions based...
research
02/25/2021

Computing Accurate Probabilistic Estimates of One-D Entropy from Equiprobable Random Samples

We develop a simple Quantile Spacing (QS) method for accurate probabilis...
research
02/24/2017

Nonparanormal Information Estimation

We study the problem of using i.i.d. samples from an unknown multivariat...
research
11/18/2019

Estimating Entropy of Distributions in Constant Space

We consider the task of estimating the entropy of k-ary distributions fr...
research
09/08/2018

Hybrid Statistical Estimation of Mutual Information and its Application to Information Flow

Analysis of a probabilistic system often requires to learn the joint pro...
research
01/26/2021

Tree boosting for learning probability measures

Learning probability measures based on an i.i.d. sample is a fundamental...

Please sign up or login with your details

Forgot password? Click here to reset