Unsupervised Learning of Mixture Models with a Uniform Background Component

04/08/2018
by   Sida Liu, et al.
0

Gaussian Mixture Models are one of the most studied and mature models in unsupervised learning. However, outliers are often present in the data and could influence the cluster estimation. In this paper, we study a new model that assumes that data comes from a mixture of a number of Gaussians as well as a uniform "background" component assumed to contain outliers and other non-interesting observations. We develop a novel method based on robust loss minimization that performs well in clustering such GMM with a uniform background. We give theoretical guarantees for our clustering algorithm to obtain best clustering results with high probability. Besides, we show that the result of our algorithm does not depend on initialization or local optima, and the parameter tuning is an easy task. By numeric simulations, we demonstrate that our algorithm enjoys high accuracy and achieves the best clustering results given a large enough sample size. Finally, experimental comparisons with typical clustering methods on real datasets witness the potential of our algorithm in real applications.

READ FULL TEXT

page 17

page 18

page 20

research
02/28/2023

Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

Clustering is a widely used technique with a long and rich history in a ...
research
02/05/2021

Vine copula mixture models and clustering for non-Gaussian data

The majority of finite mixture models suffer from not allowing asymmetri...
research
09/30/2022

Unsupervised Multi-task and Transfer Learning on Gaussian Mixture Models

Unsupervised learning has been widely used in many real-world applicatio...
research
08/28/2023

Some issues in robust clustering

Some key issues in robust clustering are discussed with focus on Gaussia...
research
09/19/2022

SMIXS: Novel efficient algorithm for non-parametric mixture regression-based clustering

We investigate a novel non-parametric regression-based clustering algori...
research
08/16/2019

Regression on imperfect class labels derived by unsupervised clustering

Outcome regressed on class labels identified by unsupervised clustering ...
research
12/27/2020

Generalized Categorisation of Digital Pathology Whole Image Slides using Unsupervised Learning

This project aims to break down large pathology images into small tiles ...

Please sign up or login with your details

Forgot password? Click here to reset