k-Means is a Variational EM Approximation of Gaussian Mixture Models

04/16/2017
by   Jörg Lücke, et al.
0

We show that k-means (Lloyd's algorithm) is equivalent to a variational EM approximation of a Gaussian Mixture Model (GMM) with isotropic Gaussians. The k-means algorithm is obtained if truncated posteriors are used as variational distributions. In contrast to the standard way to relate k-means and GMMs, we show that it is not required to consider the limit case of Gaussians with zero variance. There are a number of consequences following from our observation: (A) k-means can be shown to monotonously increase the free-energy associated with truncated distributions; (B) Using the free-energy, we can derive an explicit and compact formula of a lower GMM likelihood bound which uses the k-means objective as argument; (C) We can generalize k-means using truncated variational EM, and relate such generalizations to other k-means-like algorithms. In general, truncated variational EM provides a natural and quantitative link between k-means-like clustering and GMM clustering algorithms which may be very relevant for future theoretical as well as empirical studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2017

Can clustering scale sublinearly with its clusters? A variational EM acceleration of GMMs and k-means

One iteration of k-means or EM for Gaussian mixture models (GMMs) scales...
research
10/10/2016

Truncated Variational Expectation Maximization

We derive a novel variational expectation maximization approach based on...
research
10/01/2018

Accelerated Training of Large-Scale Gaussian Mixtures by a Merger of Sublinear Approaches

We combine two recent lines of research on sublinear clustering to signi...
research
06/28/2020

Breathing k-Means

We propose a new algorithm for the k-means problem which repeatedly incr...
research
07/28/2017

An Open Source C++ Implementation of Multi-Threaded Gaussian Mixture Models, k-Means and Expectation Maximisation

Modelling of multivariate densities is a core component in many signal p...
research
11/15/2012

A Truncated EM Approach for Spike-and-Slab Sparse Coding

We study inference and learning based on a sparse coding model with `spi...
research
10/23/2020

From the Expectation Maximisation Algorithm to Autoencoded Variational Bayes

Although the expectation maximisation (EM) algorithm was introduced in 1...

Please sign up or login with your details

Forgot password? Click here to reset