Robust M-Estimation Based Bayesian Cluster Enumeration for Real Elliptically Symmetric Distributions

05/04/2020
by   Christian A. Schroth, et al.
0

Robustly determining the optimal number of clusters in a data set is an essential factor in a wide range of applications. Cluster enumeration becomes challenging when the true underlying structure in the observed data is corrupted by heavy-tailed noise and outliers. Recently, Bayesian cluster enumeration criteria have been derived by formulating cluster enumeration as maximization of the posterior probability of candidate models. This article generalizes robust Bayesian cluster enumeration so that it can be used with any arbitrary Real Elliptically Symmetric (RES) distributed mixture model. Our framework also covers the case of M-estimators that allow for mixture models, which are decoupled from a specific probability distribution. Examples of Huber's and Tukey's M-estimators are discussed. We derive a robust criterion for for data sets with finite sample size, and also provide an asymptotic approximation to reduce the computational cost at large sample sizes. The algorithms are applied to simulated and real-world data sets, including radar-based person identification, and show a significant robustness improvement in comparison to existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2018

Robust Bayesian Cluster Enumeration

A major challenge in cluster analysis is that the number of data cluster...
research
06/30/2020

Real Elliptically Skewed Distributions and Their Application to Robust Cluster Analysis

This article proposes a new class of Real Elliptically Skewed (RESK) dis...
research
07/27/2019

Bayesian Robustness: A Nonasymptotic Viewpoint

We study the problem of robustly estimating the posterior distribution f...
research
12/29/2022

Robust Bayesian Subspace Identification for Small Data Sets

Model estimates obtained from traditional subspace identification method...
research
02/25/2020

Classical and Bayesian Analyses of a Mixture of Exponential and Lomax Distributions

The exponential and the Lomax distributions are widely used in life test...
research
10/22/2017

A Novel Bayesian Cluster Enumeration Criterion for Unsupervised Learning

The Bayesian Information Criterion (BIC) has been widely used for estima...
research
10/12/2020

Robust Finite Mixture Regression for Heterogeneous Targets

Finite Mixture Regression (FMR) refers to the mixture modeling scheme wh...

Please sign up or login with your details

Forgot password? Click here to reset