EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis

09/04/2015
by   Israel D. Gebru, et al.
0

Data clustering has received a lot of attention and numerous methods, algorithms and software packages are available. Among these techniques, parametric finite-mixture models play a central role due to their interesting mathematical properties and to the existence of maximum-likelihood estimators based on expectation-maximization (EM). In this paper we propose a new mixture model that associates a weight with each observed point. We introduce the weighted-data Gaussian mixture and we derive two EM algorithms. The first one considers a fixed weight for each observation. The second one treats each weight as a random variable following a gamma distribution. We propose a model selection method based on a minimum message length criterion, provide a weight initialization strategy, and validate the proposed algorithms by comparing them with several state of the art parametric and non-parametric clustering techniques. We also demonstrate the effectiveness and robustness of the proposed clustering technique in the presence of heterogeneous data, namely audio-visual scene analysis.

READ FULL TEXT

page 10

page 12

page 14

research
07/03/2013

An Efficient Model Selection for Gaussian Mixture Model in a Bayesian Framework

In order to cluster or partition data, we often use Expectation-and-Maxi...
research
03/23/2012

k-MLE: A fast algorithm for learning statistical mixture models

We describe k-MLE, a fast and efficient local search algorithm for learn...
research
03/30/2020

Spectral graph clustering via the Expectation-Solution algorithm

The stochastic blockmodel (SBM) models the connectivity within and betwe...
research
12/09/2020

Conjugate Mixture Models for Clustering Multimodal Data

The problem of multimodal clustering arises whenever the data are gather...
research
01/06/2023

Non-parametric Multi-Partitions Clustering

In the framework of model-based clustering, a model, called multi-partit...
research
02/08/2017

Clustering For Point Pattern Data

Clustering is one of the most common unsupervised learning tasks in mach...
research
06/22/2023

Feature screening for clustering analysis

In this paper, we consider feature screening for ultrahigh dimensional c...

Please sign up or login with your details

Forgot password? Click here to reset