Using Subset Log-Likelihoods to Trim Outliers in Gaussian Mixture Models

07/02/2019
by   Katharine M. Clark, et al.
2

Mixtures of Gaussian distributions are a popular choice in model-based clustering. Outliers can affect parameters estimation and, as such, must be accounted for. Algorithms such as TCLUST discern the most likely outliers, but only when the proportion of outlying points is known a priori. It is proved that, for a finite Gaussian mixture model, the log-likelihoods of the subset models are beta-distributed. An algorithm is then proposed that predicts the proportion of outliers by measuring the adherence of a set of subset log-likelihoods to a beta reference distribution. This algorithm removes the least likely points, which are deemed outliers, until model assumptions are met.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2020

Robust mixture regression with Exponential Power distribution

Assuming an exponential power distribution is one way to deal with outli...
research
08/28/2023

Some issues in robust clustering

Some key issues in robust clustering are discussed with focus on Gaussia...
research
10/23/2017

Standardization of multivariate Gaussian mixture models and background adjustment of PET images in brain oncology

Given observations from a multivariate Gaussian mixture model plus outli...
research
08/28/2018

DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model

In this paper, we address the problem of how to robustly train a ConvNet...
research
02/26/2014

Robust Asymmetric Clustering

Contaminated mixture models are developed for model-based clustering of ...
research
09/24/2017

On Fractionally-Supervised Classification: Weight Selection and Extension to the Multivariate t-Distribution

Recent work on fractionally-supervised classification (FSC), an approach...

Please sign up or login with your details

Forgot password? Click here to reset