Beyond the Chinese Restaurant and Pitman-Yor processes: Statistical Models with Double Power-law Behavior

02/13/2019
by   Fadhel Ayed, et al.
0

Bayesian nonparametric approaches, in particular the Pitman-Yor process and the associated two-parameter Chinese Restaurant process, have been successfully used in applications where the data exhibit a power-law behavior. Examples include natural language processing, natural images or networks. There is also growing empirical evidence that some datasets exhibit a two-regime power-law behavior: one regime for small frequencies, and a second regime, with a different exponent, for high frequencies. In this paper, we introduce a class of completely random measures which are doubly regularly-varying. Contrary to the Pitman-Yor process, we show that when completely random measures in this class are normalized to obtain random probability measures and associated random partitions, such partitions exhibit a double power-law behavior. We discuss in particular three models within this class: the beta prime process (Broderick et al. (2015, 2018), a novel process called generalized BFRY process, and a mixture construction. We derive efficient Markov chain Monte Carlo algorithms to estimate the parameters of these models. Finally, we show that the proposed models provide a better fit than the Pitman-Yor process on various datasets.

READ FULL TEXT
research
11/20/2017

Non-exchangeable random partition models for microclustering

Many popular random partition models, such as the Chinese restaurant pro...
research
10/09/2020

Generalization of the power-law rating curve using hydrodynamic theory and Bayesian hierarchical modeling

The power-law rating curve has been used extensively in hydraulic practi...
research
06/25/2022

Random Processes With Power Law Spectral Density

A statistical model of discrete finite length random processes with nega...
research
02/27/2019

Nonnegative Bayesian nonparametric factor models with completely random measures for community detection

We present a Bayesian nonparametric Poisson factorization model for mode...
research
08/07/2020

From the power law to extreme value mixture distributions

The power law is useful in describing count phenomena such as network de...
research
12/08/2015

Gibbs-type Indian buffet processes

We investigate a class of feature allocation models that generalize the ...
research
06/29/2021

Damping effect in innovation processes: case studies from Twitter

Understanding the innovation process, that is the underlying mechanisms ...

Please sign up or login with your details

Forgot password? Click here to reset