Repulsive Mixture Models of Exponential Family PCA for Clustering

04/07/2020
by   Maoying Qiao, et al.
0

The mixture extension of exponential family principal component analysis (EPCA) was designed to encode much more structural information about data distribution than the traditional EPCA does. For example, due to the linearity of EPCA's essential form, nonlinear cluster structures cannot be easily handled, but they are explicitly modeled by the mixing extensions. However, the traditional mixture of local EPCAs has the problem of model redundancy, i.e., overlaps among mixing components, which may cause ambiguity for data clustering. To alleviate this problem, in this paper, a repulsiveness-encouraging prior is introduced among mixing components and a diversified EPCA mixture (DEPCAM) model is developed in the Bayesian framework. Specifically, a determinantal point process (DPP) is exploited as a diversity-encouraging prior distribution over the joint local EPCAs. As required, a matrix-valued measure for L-ensemble kernel is designed, within which, ℓ_1 constraints are imposed to facilitate selecting effective PCs of local EPCAs, and angular based similarity measure are proposed. An efficient variational EM algorithm is derived to perform parameter learning and hidden variable inference. Experimental results on both synthetic and real-world datasets confirm the effectiveness of the proposed method in terms of model parsimony and generalization ability on unseen test data.

READ FULL TEXT

page 1

page 9

page 10

page 12

page 14

research
01/07/2016

Mixture of Bilateral-Projection Two-dimensional Probabilistic Principal Component Analysis

The probabilistic principal component analysis (PPCA) is built upon a gl...
research
01/16/2013

Model Selection for Gaussian Mixture Models

This paper is concerned with an important issue in finite mixture modell...
research
01/06/2019

Learning Nonlinear Mixtures: Identifiability and Algorithm

Linear mixture models have proven very useful in a plethora of applicati...
research
01/31/2017

A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering

We propose an effective method to solve the event sequence clustering pr...
research
10/04/2021

Row-clustering of a Point Process-valued Matrix

Structured point process data harvested from various platforms poses new...
research
11/23/2017

Diversity-Promoting Bayesian Learning of Latent Variable Models

To address three important issues involved in latent variable models (LV...
research
11/30/2019

Dis-entangling Mixture of Interventions on a Causal Bayesian Network Using Aggregate Observations

We study the problem of separating a mixture of distributions, all of wh...

Please sign up or login with your details

Forgot password? Click here to reset