Dirichlet Fragmentation Processes

09/16/2015
by   Hong Ge, et al.
0

Tree structures are ubiquitous in data across many domains, and many datasets are naturally modelled by unobserved tree structures. In this paper, first we review the theory of random fragmentation processes [Bertoin, 2006], and a number of existing methods for modelling trees, including the popular nested Chinese restaurant process (nCRP). Then we define a general class of probability distributions over trees: the Dirichlet fragmentation process (DFP) through a novel combination of the theory of Dirichlet processes and random fragmentation processes. This DFP presents a stick-breaking construction, and relates to the nCRP in the same way the Dirichlet process relates to the Chinese restaurant process. Furthermore, we develop a novel hierarchical mixture model with the DFP, and empirically compare the new model to similar models in machine learning. Experiments show the DFP mixture model to be convincingly better than existing state-of-the-art approaches for hierarchical clustering and density modelling.

READ FULL TEXT
research
05/13/2019

Bayesian Hierarchical Mixture Clustering using Multilevel Hierarchical Dirichlet Processes

This paper focuses on the problem of hierarchical non-overlapping cluste...
research
01/19/2018

A Dirichlet Process Mixture Model of Discrete Choice

We present a mixed multinomial logit (MNL) model, which leverages the tr...
research
01/01/2018

An elementary derivation of the Chinese restaurant process from Sethuraman's stick-breaking process

The Chinese restaurant process and the stick-breaking process are the tw...
research
10/15/2018

A simple proof of Pitman-Yor's Chinese restaurant process from its stick-breaking representation

For a long time, the Dirichlet process has been the gold standard discre...
research
05/11/2022

On Dependent Dirichlet Processes for General Polish Spaces

We study Dirichlet process-based models for sets of predictor-dependent ...
research
08/02/2018

Dirichlet Mixture Model based VQ Performance Prediction for Line Spectral Frequency

In this paper, we continue our previous work on the Dirichlet mixture mo...
research
06/16/2021

Multilinear Dirichlet Processes

Dependent Dirichlet processes (DDP) have been widely applied to model da...

Please sign up or login with your details

Forgot password? Click here to reset