A robust model-based clustering based on the geometric median and the Median Covariation Matrix

Grouping observations into homogeneous groups is a recurrent task in statistical data analysis. We consider Gaussian Mixture Models, which are the most famous parametric model-based clustering method. We propose a new robust approach for model-based clustering, which consists in a modification of the EM algorithm (more specifically, the M-step) by replacing the estimates of the mean and the variance by robust versions based on the median and the median covariation matrix. All the proposed methods are available in the R package RGMM accessible on CRAN.

READ FULL TEXT
research
12/25/2013

Robust EM algorithm for model-based curve clustering

Model-based clustering approaches concern the paradigm of exploratory da...
research
01/31/2017

Variable selection for clustering with Gaussian mixture models: state of the art

The mixture models have become widely used in clustering, given its prob...
research
04/03/2023

Online stochastic Newton methods for estimating the geometric median and applications

In the context of large samples, a small number of individuals might spo...
research
03/07/2023

A Step Toward Deep Online Aggregation (Extended Version)

For exploratory data analysis, it is often desirable to know what answer...
research
02/10/2020

K-bMOM: a robust Lloyd-type clustering algorithm based on bootstrap Median-of-Means

We propose a new clustering algorithm that is robust to the presence of ...
research
01/28/2021

Robust Extrinsic Regression Analysis for Manifold Valued Data

Recently, there has been a growing need in analyzing data on manifolds o...
research
08/16/2023

Continuous Sweep: an improved, binary quantifier

Quantification is a supervised machine learning task, focused on estimat...

Please sign up or login with your details

Forgot password? Click here to reset