Large-scale gradient-based training of Mixtures of Factor Analyzers

08/26/2023
by   Alexander Gepperth, et al.
0

Gaussian Mixture Models (GMMs) are a standard tool in data analysis. However, they face problems when applied to high-dimensional data (e.g., images) due to the size of the required full covariance matrices (CMs), whereas the use of diagonal or spherical CMs often imposes restrictions that are too severe. The Mixture of Factor analyzers (MFA) model is an important extension of GMMs, which allows to smoothly interpolate between diagonal and full CMs based on the number of factor loadings l. MFA has successfully been applied for modeling high-dimensional image data. This article contributes both a theoretical analysis as well as a new method for efficient high-dimensional MFA training by stochastic gradient descent, starting from random centroid initializations. This greatly simplifies the training and initialization process, and avoids problems of batch-type algorithms such Expectation-Maximization (EM) when training with huge amounts of data. In addition, by exploiting the properties of the matrix determinant lemma, we prove that MFA training and inference/sampling can be performed based on precision matrices, which does not require matrix inversions after training is completed. At training time, the methods requires the inversion of l× l matrices only. Besides the theoretical analysis and proofs, we apply MFA to typical image datasets such as SVHN and MNIST, and demonstrate the ability to perform sample generation and outlier detection.

READ FULL TEXT
research
12/18/2019

Gradient-based training of Gaussian Mixture Models in High-Dimensional Spaces

We present an approach for efficiently training Gaussian Mixture Models ...
research
12/11/2022

Stochastic First-Order Learning for Large-Scale Flexibly Tied Gaussian Mixture Model

Gaussian Mixture Models (GMM) are one of the most potent parametric dens...
research
07/21/2023

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

The expectation-maximization (EM) algorithm and its variants are widely ...
research
07/27/2019

A Matrix–free Likelihood Method for Exploratory Factor Analysis of High-dimensional Gaussian Data

This paper proposes a novel profile likelihood method for estimating the...
research
07/03/2023

Learning Mixtures of Gaussians Using the DDPM Objective

Recent works have shown that diffusion models can learn essentially any ...
research
11/22/2017

SNeCT: Scalable network constrained Tucker decomposition for integrative multi-platform data analysis

Motivation: How do we integratively analyze large-scale multi-platform g...
research
05/25/2023

High-dimensional Response Growth Curve Modeling for Longitudinal Neuroimaging Analysis

There is increasing interest in modeling high-dimensional longitudinal o...

Please sign up or login with your details

Forgot password? Click here to reset