From the EM Algorithm to the CM-EM Algorithm for Global Convergence of Mixture Models

10/26/2018
by   Chenguang Lu, et al.
0

The Expectation-Maximization (EM) algorithm for mixture models often results in slow or invalid convergence. The popular convergence proof affirms that the likelihood increases with Q; Q is increasing in the M -step and non-decreasing in the E-step. The author found that (1) Q may and should decrease in some E-steps; (2) The Shannon channel from the E-step is improper and hence the expectation is improper. The author proposed the CM-EM algorithm (CM means Channel's Matching), which adds a step to optimize the mixture ratios for the proper Shannon channel and maximizes G, average log-normalized-likelihood, in the M-step. Neal and Hinton's Maximization-Maximization (MM) algorithm use F instead of Q to speed the convergence. Maximizing G is similar to maximizing F. The new convergence proof is similar to Beal's proof with the variational method. It first proves that the minimum relative entropy equals the minimum R-G (R is mutual information), then uses variational and iterative methods that Shannon et al. use for rate-distortion functions to prove the global convergence. Some examples show that Q and F should and may decrease in some E-steps. For the same example, the EM, MM, and CM-EM algorithms need about 36, 18, and 9 iterations respectively.

READ FULL TEXT
research
04/21/2021

Understanding and Accelerating EM Algorithm's Convergence by Fair Competition Principle and Rate-Verisimilitude Function

Why can the Expectation-Maximization (EM) algorithm for mixture models c...
research
01/12/2018

Noisy Expectation-Maximization: Applications and Generalizations

We present a noise-injected version of the Expectation-Maximization (EM)...
research
07/25/2020

Fair Marriage Principle and Initialization Map for the EM Algorithm

The popular convergence theory of the EM algorithm explains that the obs...
research
10/26/2018

SQUAREM: An R Package for Off-the-Shelf Acceleration of EM, MM and Other EM-like Monotone Algorithms

We discuss R package SQUAREM for accelerating iterative algorithms which...
research
12/25/2013

Classification automatique de données temporelles en classes ordonnées

This paper proposes a method of segmenting temporal data into ordered cl...
research
01/09/2019

Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model

Latent class model (LCM), which is a finite mixture of different categor...
research
06/20/2018

An Asynchronous Distributed Expectation Maximization Algorithm For Massive Data: The DEM Algorithm

The family of Expectation-Maximization (EM) algorithms provides a genera...

Please sign up or login with your details

Forgot password? Click here to reset