Exploring Bayesian Models for Multi-level Clustering of Hierarchically Grouped Sequential Data

04/19/2015
by   Adway Mitra, et al.
0

A wide range of Bayesian models have been proposed for data that is divided hierarchically into groups. These models aim to cluster the data at different levels of grouping, by assigning a mixture component to each datapoint, and a mixture distribution to each group. Multi-level clustering is facilitated by the sharing of these components and distributions by the groups. In this paper, we introduce the concept of Degree of Sharing (DoS) for the mixture components and distributions, with an aim to analyze and classify various existing models. Next we introduce a generalized hierarchical Bayesian model, of which the existing models can be shown to be special cases. Unlike most of these models, our model takes into account the sequential nature of the data, and various other temporal structures at different levels while assigning mixture components and distributions. We show one specialization of this model aimed at hierarchical segmentation of news transcripts, and present a Gibbs Sampling based inference algorithm for it. We also show experimentally that the proposed model outperforms existing models for the same task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2015

Nested Hierarchical Dirichlet Processes for Multi-Level Non-Parametric Admixture Modeling

Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mi...
research
06/27/2018

Quantile-based clustering

A new cluster analysis method, K-quantiles clustering, is introduced. K-...
research
04/29/2012

Dissimilarity Clustering by Hierarchical Multi-Level Refinement

We introduce in this paper a new way of optimizing the natural extension...
research
12/09/2022

Model-based clustering of categorical data based on the Hamming distance

A model-based approach is developed for clustering categorical data with...
research
11/10/2014

Sparse Estimation with Generalized Beta Mixture and the Horseshoe Prior

In this paper, the use of the Generalized Beta Mixture (GBM) and Horsesh...
research
08/12/2017

Bayesian Non-Exhaustive Classification for Active Online Name Disambiguation

The name disambiguation task partitions a collection of records pertaini...
research
03/08/2018

From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation

In crowdsourced preference aggregation, it is often assumed that all the...

Please sign up or login with your details

Forgot password? Click here to reset