A tractable Multi-Partitions Clustering

01/22/2018
by   Matthieu Marbac, et al.
0

In the framework of model-based clustering, a model allowing several latent class variables is proposed. This model assumes that the distribution of the observed data can be factorized into several independent blocks of variables. Each block is assumed to follow a latent class model ( i.e., mixture with conditional independence assumption). The proposed model includes variable selection, as a special case, and is able to cope with the mixed-data setting. The simplicity of the model allows to estimate the repartition of the variables into blocks and the mixture parameters simultaneously, thus avoiding to run EM algorithms for each possible repartition of variables into blocks. For the proposed method, a model is defined by the number of blocks, the number of clusters inside each block and the repartition of variables into block. Model selection can be done with two information criteria, the BIC and the MICL, for which an efficient optimization is proposed. The performances of the model are investigated on simulated and real data. It is shown that the proposed method gives a rich interpretation of the dataset at hand ( i.e., analysis of the repartition of the variables into blocks and analysis of the clusters produced by each block of variables).

READ FULL TEXT

page 16

page 17

page 18

page 19

research
01/06/2023

Non-parametric Multi-Partitions Clustering

In the framework of model-based clustering, a model, called multi-partit...
research
08/25/2018

Relaxing the Identically Distributed Assumption in Gaussian Co-Clustering for High Dimensional Data

A co-clustering model for continuous data that relaxes the identically d...
research
02/02/2023

High-dimensional variable clustering based on sub-asymptotic maxima of a weakly dependent random process

We propose a new class of models for variable clustering called Asymptot...
research
09/10/2020

A Family of Mixture Models for Biclustering

Biclustering is used for simultaneous clustering of the observations and...
research
11/08/2021

Adaptive Steganography Based on bargain Game

The capacity and security of the confidential message on the channel are...
research
12/16/2022

Modelling and analysis of rank ordered data with ties via a generalized Plackett-Luce model

A simple generative model for rank ordered data with ties is presented. ...

Please sign up or login with your details

Forgot password? Click here to reset