Profile Entropy: A Fundamental Measure for the Learnability and Compressibility of Discrete Distributions

02/26/2020
by   Yi Hao, et al.
0

The profile of a sample is the multiset of its symbol frequencies. We show that for samples of discrete distributions, profile entropy is a fundamental measure unifying the concepts of estimation, inference, and compression. Specifically, profile entropy a) determines the speed of estimating the distribution relative to the best natural estimator; b) characterizes the rate of inferring all symmetric properties compared with the best estimator over any label-invariant distribution collection; c) serves as the limit of profile compression, for which we derive optimal near-linear-time block and sequential algorithms. To further our understanding of profile entropy, we investigate its attributes, provide algorithms for approximating its value, and determine its magnitude for numerous structural distribution families.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2019

Efficient Profile Maximum Likelihood for Universal Symmetric Property Estimation

Estimating symmetric properties of a distribution, e.g. support size, co...
research
04/07/2020

The Optimality of Profile Maximum Likelihood in Estimating Sorted Discrete Distributions

A striking result of [Acharya et al. 2017] showed that to estimate symme...
research
11/05/2020

Instance Based Approximations to Profile Maximum Likelihood

In this paper we provide a new efficient algorithm for approximately com...
research
03/29/2019

Data Amplification: A Unified and Competitive Approach to Property Estimation

Estimating properties of discrete distributions is a fundamental problem...
research
07/11/2012

On Modeling Profiles instead of Values

We consider the problem of estimating the distribution underlying an obs...
research
03/14/2019

Profile Closeness in Complex Networks

We introduce a new centrality measure, known as profile closeness, for c...
research
05/21/2020

Extrapolating the profile of a finite population

We study a prototypical problem in empirical Bayes. Namely, consider a p...

Please sign up or login with your details

Forgot password? Click here to reset