Generalized Negative Binomial Processes and the Representation of Cluster Structures

10/07/2013
by   Mingyuan Zhou, et al.
0

The paper introduces the concept of a cluster structure to define a joint distribution of the sample size and its exchangeable random partitions. The cluster structure allows the probability distribution of the random partitions of a subset of the sample to be dependent on the sample size, a feature not presented in a partition structure. A generalized negative binomial process count-mixture model is proposed to generate a cluster structure, where in the prior the number of clusters is finite and Poisson distributed and the cluster sizes follow a truncated negative binomial distribution. The number and sizes of clusters can be controlled to exhibit distinct asymptotic behaviors. Unique model properties are illustrated with example clustering results using a generalized Polya urn sampling scheme. The paper provides new methods to generate exchangeable random partitions and to control both the cluster-number and cluster-size distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/20/2017

Non-exchangeable random partition models for microclustering

Many popular random partition models, such as the Chinese restaurant pro...
research
10/28/2014

Beta-Negative Binomial Process and Exchangeable Random Partitions for Mixed-Membership Modeling

The beta-negative binomial process (BNBP), an integer-valued stochastic ...
research
09/21/2020

Sample Size Calculation for Cluster Randomized Trials with Zero-inflated Count Outcomes

Cluster randomized trails (CRT) have been widely employed in medical and...
research
05/05/2021

Optimally partitioning signed networks based on generalized balance

Signed networks, which contain both positive and negative edges, are now...
research
08/23/2020

A Prior for Record Linkage Based on Allelic Partitions

In database management, record linkage aims to identify multiple records...
research
12/12/2021

Identifying bias in cluster quality metrics

We study potential biases of popular cluster quality metrics, such as co...
research
07/10/2019

A Range Matching CAM for Hierarchical Defect Tolerance Technique in NRAM Structures

Due to the small size of nanoscale devices, they are highly prone to pro...

Please sign up or login with your details

Forgot password? Click here to reset