Stochastic First-Order Learning for Large-Scale Flexibly Tied Gaussian Mixture Model

12/11/2022
by   Mohammad Pasande, et al.
0

Gaussian Mixture Models (GMM) are one of the most potent parametric density estimators based on the kernel model that finds application in many scientific domains. In recent years, with the dramatic enlargement of data sources, typical machine learning algorithms, e.g. Expectation Maximization (EM), encounters difficulty with high-dimensional and streaming data. Moreover, complicated densities often demand a large number of Gaussian components. This paper proposes a fast online parameter estimation algorithm for GMM by using first-order stochastic optimization. This approach provides a framework to cope with the challenges of GMM when faced with high-dimensional streaming data and complex densities by leveraging the flexibly-tied factorization of the covariance matrix. A new stochastic Manifold optimization algorithm that preserves the orthogonality is introduced and used along with the well-known Euclidean space numerical optimization. Numerous empirical results on both synthetic and real datasets justify the effectiveness of our proposed stochastic method over EM-based methods in the sense of better-converged maximum for likelihood function, fewer number of needed epochs for convergence, and less time consumption per epoch.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2017

An Alternative to EM for Gaussian Mixture Models: Batch and Stochastic Riemannian Optimization

We consider maximum likelihood estimation for Gaussian Mixture Models (G...
research
06/25/2015

Manifold Optimization for Gaussian Mixture Models

We take a new look at parameter estimation for Gaussian Mixture Models (...
research
08/26/2023

Large-scale gradient-based training of Mixtures of Factor Analyzers

Gaussian Mixture Models (GMMs) are a standard tool in data analysis. How...
research
03/10/2019

One-Pass Sparsified Gaussian Mixtures

We present a one-pass sparsified Gaussian mixture model (SGMM). Given P-...
research
09/19/2016

Online and Distributed learning of Gaussian mixture models by Bayesian Moment Matching

The Gaussian mixture model is a classic technique for clustering and dat...
research
08/29/2023

Bridging Distribution Learning and Image Clustering in High-dimensional Space

Distribution learning focuses on learning the probability density functi...
research
04/22/2018

Sparse Travel Time Estimation from Streaming Data

We address two shortcomings in online travel time estimation methods for...

Please sign up or login with your details

Forgot password? Click here to reset