Deep Embedded K-Means Clustering

09/30/2021
by   Wengang Guo, et al.
7

Recently, deep clustering methods have gained momentum because of the high representational power of deep neural networks (DNNs) such as autoencoder. The key idea is that representation learning and clustering can reinforce each other: Good representations lead to good clustering while good clustering provides good supervisory signals to representation learning. Critical questions include: 1) How to optimize representation learning and clustering? 2) Should the reconstruction loss of autoencoder be considered always? In this paper, we propose DEKM (for Deep Embedded K-Means) to answer these two questions. Since the embedding space generated by autoencoder may have no obvious cluster structures, we propose to further transform the embedding space to a new space that reveals the cluster-structure information. This is achieved by an orthonormal transformation matrix, which contains the eigenvectors of the within-class scatter matrix of K-means. The eigenvalues indicate the importance of the eigenvectors' contributions to the cluster-structure information in the new space. Our goal is to increase the cluster-structure information. To this end, we discard the decoder and propose a greedy method to optimize the representation. Representation learning and clustering are alternately optimized by DEKM. Experimental results on the real-world datasets demonstrate that DEKM achieves state-of-the-art performance.

READ FULL TEXT

page 1

page 6

page 8

research
11/23/2021

Exploring Non-Contrastive Representation Learning for Deep Clustering

Existing deep clustering methods rely on contrastive learning for repres...
research
12/06/2018

RDEC: Integrating Regularization into Deep Embedded Clustering for Imbalanced Datasets

Clustering is a fundamental machine learning task and can be used in man...
research
01/28/2019

Hierarchically Clustered Representation Learning

The joint optimization of representation learning and clustering in the ...
research
02/02/2021

Deep Autoencoder-based Fuzzy C-Means for Topic Detection

Topic detection is a process for determining topics from a collection of...
research
12/02/2022

Improved Representation Learning Through Tensorized Autoencoders

The central question in representation learning is what constitutes a go...
research
10/30/2019

Meta-Learning to Cluster

Clustering is one of the most fundamental and wide-spread techniques in ...
research
10/03/2021

Information Elicitation Meets Clustering

In the setting where we want to aggregate people's subjective evaluation...

Please sign up or login with your details

Forgot password? Click here to reset