A dependent partition-valued process for multitask clustering and time evolving network modelling

03/13/2013
by   Konstantina Palla, et al.
0

The fundamental aim of clustering algorithms is to partition data points. We consider tasks where the discovered partition is allowed to vary with some covariate such as space or time. One approach would be to use fragmentation-coagulation processes, but these, being Markov processes, are restricted to linear or tree structured covariate spaces. We define a partition-valued process on an arbitrary covariate space using Gaussian processes. We use the process to construct a multitask clustering model which partitions datapoints in a similar way across multiple data sources, and a time series model of network data which allows cluster assignments to vary over time. We describe sampling algorithms for inference and apply our method to defining cancer subtypes based on different types of cellular characteristics, finding regulatory modules from gene expression data from multiple human populations, and discovering time varying community structure in a social network.

READ FULL TEXT

page 7

page 8

research
08/19/2019

AdaptSPEC-X: Covariate Dependent Spectral Modeling of Multiple Nonstationary Time Series

We present a method for the joint analysis of a panel of possibly nonsta...
research
08/19/2021

Bayesian Semiparametric Hidden Markov Tensor Partition Models for Longitudinal Data with Local Variable Selection

We present a flexible Bayesian semiparametric mixed model for longitudin...
research
10/14/2021

Dynamical non-Gaussian modelling of spatial processes

Spatio-temporal processes in environmental applications are often assume...
research
01/29/2019

Centered Partition Process: Informative Priors for Clustering

There is a very rich literature proposing Bayesian approaches for cluste...
research
08/08/2020

Clustering Network Tree Data From Respondent-driven sampling with application to opioid users in New York City

There is great interest in finding meaningful subgroups of attributed ne...
research
12/23/2018

Detecting British Columbia Coastal Rainfall Patterns by Clustering Gaussian Processes

Functional data analysis is a statistical framework where data are assum...
research
09/04/2020

Adaptive preferential sampling in phylodynamics

Longitudinal molecular data of rapidly evolving viruses and pathogens pr...

Please sign up or login with your details

Forgot password? Click here to reset