Mixture of von Mises-Fisher distribution with sparse prototypes

12/30/2022
by   Fabrice Rossi, et al.
0

Mixtures of von Mises-Fisher distributions can be used to cluster data on the unit hypersphere. This is particularly adapted for high-dimensional directional data such as texts. We propose in this article to estimate a von Mises mixture using a l 1 penalized likelihood. This leads to sparse prototypes that improve clustering interpretability. We introduce an expectation-maximisation (EM) algorithm for this estimation and explore the trade-off between the sparsity term and the likelihood one with a path following algorithm. The model's behaviour is studied on simulated data and, we show the advantages of the approach on real data benchmark. We also introduce a new data set on financial reports and exhibit the benefits of our method for exploratory analysis.

READ FULL TEXT

page 22

page 27

page 28

page 38

research
09/07/2020

Penalized Maximum Likelihood Estimator for Mixture of von Mises-Fisher Distributions

The von Mises-Fisher distribution is one of the most widely used probabi...
research
03/15/2015

Statistical Estimation and Clustering of Group-invariant Orientation Parameters

We treat the problem of estimation of orientation parameters whose value...
research
02/10/2021

On PyTorch Implementation of Density Estimators for von Mises-Fisher and Its Mixture

The von Mises-Fisher (vMF) is a well-known density model for directional...
research
01/07/2014

Key point selection and clustering of swimmer coordination through Sparse Fisher-EM

To answer the existence of optimal swimmer learning/teaching strategies,...
research
08/16/2017

A Generalised Directional Laplacian Distribution: Estimation, Mixture Models and Audio Source Separation

Directional or Circular statistics are pertaining to the analysis and in...
research
12/08/2020

A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering

High-dimensional data clustering has become and remains a challenging ta...
research
02/27/2015

Minimum message length estimation of mixtures of multivariate Gaussian and von Mises-Fisher distributions

Mixture modelling involves explaining some observed evidence using a com...

Please sign up or login with your details

Forgot password? Click here to reset