A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering

12/08/2020
by   Nicolas Jouvin, et al.
0

High-dimensional data clustering has become and remains a challenging task for modern statistics and machine learning, with a wide range of applications. We consider in this work the powerful discriminative latent mixture model, and we extend it to the Bayesian framework. Modeling data as a mixture of Gaussians in a low-dimensional discriminative subspace, a Gaussian prior distribution is introduced over the latent group means and a family of twelve submodels are derived considering different covariance structures. Model inference is done with a variational EM algorithm, while the discriminative subspace is estimated via a Fisher-step maximizing an unsupervised Fisher criterion. An empirical Bayes procedure is proposed for the estimation of the prior hyper-parameters, and an integrated classification likelihood criterion is derived for selecting both the number of clusters and the submodel. The performances of the resulting Bayesian Fisher-EM algorithm are investigated in two thorough simulated scenarios, regarding both dimensionality as well as noise and assessing its superiority with respect to state-of-the-art Gaussian subspace clustering models. In addition to standard real data benchmarks, an application to single image denoising is proposed, displaying relevant results. This work comes with a reference implementation for the R software in the FisherEM package accompanying the paper.

READ FULL TEXT

page 16

page 18

page 19

page 20

page 24

page 26

research
01/12/2011

Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Clustering in high-dimensional spaces is nowadays a recurrent problem in...
research
01/07/2014

Key point selection and clustering of swimmer coordination through Sparse Fisher-EM

To answer the existence of optimal swimmer learning/teaching strategies,...
research
12/25/2013

Classification automatique de données temporelles en classes ordonnées

This paper proposes a method of segmenting temporal data into ordered cl...
research
11/16/2020

Cluster-Specific Predictions with Multi-Task Gaussian Processes

A model involving Gaussian processes (GPs) is introduced to simultaneous...
research
12/30/2022

Mixture of von Mises-Fisher distribution with sparse prototypes

Mixtures of von Mises-Fisher distributions can be used to cluster data o...
research
12/02/2018

GAN-EM: GAN based EM learning framework

Expectation maximization (EM) algorithm is to find maximum likelihood so...
research
09/03/2023

Probabilistic Reduced-Dimensional Vector Autoregressive Modeling for Dynamics Prediction and Reconstruction with Oblique Projections

In this paper, we propose a probabilistic reduced-dimensional vector aut...

Please sign up or login with your details

Forgot password? Click here to reset