Centered Partition Process: Informative Priors for Clustering

01/29/2019
by   Sally Paganin, et al.
0

There is a very rich literature proposing Bayesian approaches for clustering starting with a prior probability distribution on partitions. Most approaches assume exchangeability, leading to simple representations in terms of Exchangeable Partition Probability Functions (EPPF). Gibbs-type priors encompass a broad class of such cases, including Dirichlet and Pitman-Yor processes. Even though there have been some proposals to relax the exchangeability assumption, allowing covariate-dependence and partial exchangeability, limited consideration has been given on how to include concrete prior knowledge on the partition. For example, we are motivated by an epidemiological application, in which we wish to cluster birth defects into groups and we have prior knowledge of an initial clustering provided by experts. As a general approach for including such prior knowledge, we propose a Centered Partition (CP) process that modifies the EPPF to favor partitions close to an initial one. Some properties of the CP prior are described, a general algorithm for posterior computation is developed, and we illustrate the methodology through simulation examples and an application to the motivating epidemiology study of birth defects.

READ FULL TEXT

page 10

page 11

page 24

research
07/04/2019

An enriched mixture model for functional clustering

There is an increasingly rich literature about Bayesian nonparametric mo...
research
08/15/2022

Flexible Bayesian Multiple Comparison Adjustment Using Dirichlet Process and Beta-Binomial Model Priors

Researchers frequently wish to assess the equality or inequality of grou...
research
10/15/2022

Clustering blood donors via mixtures of product partition models with covariates

Motivated by the problem of accurately predicting gap times between succ...
research
06/04/2023

Bayesian nonparametric modeling of latent partitions via Stirling-gamma priors

Dirichlet process mixtures are particularly sensitive to the value of th...
research
03/13/2013

A dependent partition-valued process for multitask clustering and time evolving network modelling

The fundamental aim of clustering algorithms is to partition data points...
research
12/01/2021

Prior knowledge elicitation: The past, present, and future

Specification of the prior distribution for a Bayesian model is a centra...
research
03/30/2023

A review on Bayesian model-based clustering

Clustering is an important task in many areas of knowledge: medicine and...

Please sign up or login with your details

Forgot password? Click here to reset