Exact Good-Turing characterization of the two-parameter Poisson-Dirichlet superpopulation model

01/28/2019
by   Annalisa Cerquetti, et al.
0

Large sample size equivalence between the celebrated approximated Good-Turing estimator of the probability to discover a species already observed a certain number of times (Good, 1953) and the modern Bayesian nonparametric counterpart has been recently established by virtue of a particular smoothing rule based on the two-parameter Poisson-Dirichlet model. Here we improve on this result showing that, for any finite sample size, when the population frequencies are assumed to be selected from a superpopulation with two-parameter Poisson-Dirichlet distribution, then Bayesian nonparametric estimation of the discovery probabilities corresponds to Good-Turing exact estimation. Moreover under general superpopulation hypothesis the Good-Turing solution admits an interpretation as a modern Bayesian nonparametric estimator under partial information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2023

Limiting Behaviour of Poisson-Dirichlet and Generalised Poisson-Dirichlet Distributions

We derive large-sample and other limiting distributions of the “frequenc...
research
06/25/2018

On consistent estimation of the missing mass

Given n samples from a population of individuals belonging to different ...
research
07/06/2018

Outperforming Good-Turing: Preliminary Report

Estimating a large alphabet probability distribution from a limited numb...
research
02/27/2019

A Good-Turing estimator for feature allocation models

Feature allocation models generalize species sampling models by allowing...
research
02/02/2018

A reversal phenomenon in estimation based on multiple samples from the Poisson--Dirichlet distribution

Consider two forms of sampling from a population: (i) drawing s samples ...
research
07/19/2023

Nonparametric estimation of the jump-size distribution for a stochastic storage system with periodic observations

This work presents a non-parametric estimator for the cumulative distrib...
research
11/12/2020

Bayesian nonparametric modelling of sequential discoveries

We aim at modelling the appearance of distinct tags in a sequence of lab...

Please sign up or login with your details

Forgot password? Click here to reset