Un modèle Bayésien de co-clustering de données mixtes

02/06/2019
by   Aichetou Bouchareb, et al.
0

We propose a MAP Bayesian approach to perform and evaluate a co-clustering of mixed-type data tables. The proposed model infers an optimal segmentation of all variables then performs a co-clustering by minimizing a Bayesian model selection cost function. One advantage of this approach is that it is user parameter-free. Another main advantage is the proposed criterion which gives an exact measure of the model quality, measured by probability of fitting it to the data. Continuous optimization of this criterion ensures finding better and better models while avoiding data over-fitting. The experiments conducted on real data show the interest of this co-clustering approach in exploratory data analysis of large data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2022

Co-clustering based exploratory analysis of mixed-type data tables

Co-clustering is a class of unsupervised data analysis techniques that e...
research
12/22/2022

Model Based Co-clustering of Mixed Numerical and Binary Data

Co-clustering is a data mining technique used to extract the underlying ...
research
08/25/2018

Relaxing the Identically Distributed Assumption in Gaussian Co-Clustering for High Dimensional Data

A co-clustering model for continuous data that relaxes the identically d...
research
01/31/2022

Hierarchical clustering of mixed-type data based on barycentric coding

Clustering of mixed-type datasets can be a particularly challenging task...
research
06/07/2023

Towards High-Performance Exploratory Data Analysis (EDA) Via Stable Equilibrium Point

Exploratory data analysis (EDA) is a vital procedure for data science pr...
research
07/24/2020

New clustering approach for symbolic polygonal data: application to the clustering of entrepreneurial regimes

Entrepreneurial regimes are topic, receiving ever more research attentio...
research
05/09/2015

Simultaneous Clustering and Model Selection for Multinomial Distribution: A Comparative Study

In this paper, we study different discrete data clustering methods, whic...

Please sign up or login with your details

Forgot password? Click here to reset