Clustering by Low-Rank Doubly Stochastic Matrix Decomposition

06/18/2012
by   Zhirong Yang, et al.
0

Clustering analysis by nonnegative low-rank approximations has achieved remarkable progress in the past decade. However, most approximation approaches in this direction are still restricted to matrix factorization. We propose a new low-rank learning method to improve the clustering performance, which is beyond matrix factorization. The approximation is based on a two-step bipartite random walk through virtual cluster nodes, where the approximation is formed by only cluster assigning probabilities. Minimizing the approximation error measured by Kullback-Leibler divergence is equivalent to maximizing the likelihood of a discriminative model, which endows our method with a solid probabilistic interpretation. The optimization is implemented by a relaxed Majorization-Minimization algorithm that is advantageous in finding good local minima. Furthermore, we point out that the regularized algorithm with Dirichlet prior only serves as initialization. Experimental results show that the new method has strong performance in clustering purity for various datasets, especially for large-scale manifold data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2015

k-Means Clustering Is Matrix Factorization

We show that the objective function of conventional k-means clustering c...
research
08/15/2023

Nonnegative matrix factorization for coherent set identification by direct low rank maximum likelihood estimation

We analyze connections between two low rank modeling approaches from the...
research
08/06/2018

Regularized matrix data clustering and its application to image analysis

In this paper, we propose a regularized mixture probabilistic model to c...
research
05/29/2023

Statistically Optimal K-means Clustering via Nonnegative Low-rank Semidefinite Programming

K-means clustering is a widely used machine learning method for identify...
research
01/26/2022

Sketching for low-rank nonnegative matrix approximation: a numerical study

We propose new approximate alternating projection methods, based on rand...
research
02/08/2020

Supervised Quantile Normalization for Low-rank Matrix Approximation

Low rank matrix factorization is a fundamental building block in machine...
research
03/02/2021

Self-supervised Symmetric Nonnegative Matrix Factorization

Symmetric nonnegative matrix factorization (SNMF) has demonstrated to be...

Please sign up or login with your details

Forgot password? Click here to reset