The semi-hierarchical Dirichlet Process and its application to clustering homogeneous distributions

05/20/2020
by   Mario Beraha, et al.
0

Assessing homogeneity of distributions is an old problem that has received considerable attention, especially in the nonparametric Bayesian literature. To this effect, we propose the semi-hierarchical Dirichlet process, a novel hierarchical prior that extends the hierarchical Dirichlet process of Teh et al. (2006) and that avoids the degeneracy issues of nested processes recently described by Camerlenghi et al. (2019a). We go beyond the simple yes/no answer to the homogeneity question and embed the proposed prior in a random partition model; this procedure allows us to give a more comprehensive response to the above question and in fact find groups of populations that are internally homogeneous when I greater or equal than 2 such populations are considered. We study theoretical properties of the semi-hierarchical Dirichlet process and of the Bayes factor for the homogeneity test when I = 2. Extensive simulation studies and applications to educational data are also discussed.

READ FULL TEXT

page 22

page 25

research
01/18/2022

Flexible clustering via hidden hierarchical Dirichlet priors

The Bayesian approach to inference stands out for naturally allowing bor...
research
05/13/2019

Bayesian Hierarchical Mixture Clustering using Multilevel Hierarchical Dirichlet Processes

This paper focuses on the problem of hierarchical non-overlapping cluste...
research
07/04/2011

On a Rapid Simulation of the Dirichlet Process

We describe a simple and efficient procedure for approximating the Lévy ...
research
03/25/2021

Margin-free classification and new class detection using finite Dirichlet mixtures

We present a margin-free finite mixture model which allows us to simulta...
research
10/24/2016

Bayesian Nonparametric Modeling of Heterogeneous Groups of Censored Data

Datasets containing large samples of time-to-event data arising from sev...
research
05/06/2019

Spectral density estimation using P-spline priors

This article proposes a Bayesian approach to estimating the spectral den...
research
06/03/2019

Bayesian nonparametric graphical models for time-varying parameters VAR

Over the last decade, big data have poured into econometrics, demanding ...

Please sign up or login with your details

Forgot password? Click here to reset