Similarity-based Random Partition Distribution for Clustering Functional Data

08/03/2023
by   Tomoya Wakayama, et al.
0

Random partitioned distribution is a powerful tool for model-based clustering. However, the implementation in practice can be challenging for functional spatial data such as hourly observed population data observed in each region. The reason is that high dimensionality tends to yield excess clusters, and spatial dependencies are challenging to represent with a simple random partition distribution (e.g., the Dirichlet process). This paper addresses these issues by extending the generalized Dirichlet process to incorporate pairwise similarity information, which we call the similarity-based generalized Dirichlet process (SGDP), and provides theoretical justification for this approach. We apply SGDP to hourly population data observed in 500m meshes in Tokyo, and demonstrate its usefulness for functional clustering by taking account of spatial information.

READ FULL TEXT

page 15

page 18

research
06/04/2023

Bayesian nonparametric modeling of latent partitions via Stirling-gamma priors

Dirichlet process mixtures are particularly sensitive to the value of th...
research
07/19/2022

Clustering constrained on linear networks

An unsupervised classification method for point events occurring on a ne...
research
01/18/2022

Flexible clustering via hidden hierarchical Dirichlet priors

The Bayesian approach to inference stands out for naturally allowing bor...
research
01/04/2010

Inference of global clusters from locally distributed data

We consider the problem of analyzing the heterogeneity of clustering dis...
research
04/23/2019

Model based functional clustering of varved lake sediments

In this paper we propose a model-based method for clustering subjects fo...
research
03/30/2023

A review on Bayesian model-based clustering

Clustering is an important task in many areas of knowledge: medicine and...
research
04/26/2021

Powered Dirichlet Process for Controlling the Importance of "Rich-Get-Richer" Prior Assumptions in Bayesian Clustering

One of the most used priors in Bayesian clustering is the Dirichlet prio...

Please sign up or login with your details

Forgot password? Click here to reset