Mixing Consistent Deep Clustering

11/03/2020
by   Daniel Lutscher, et al.
8

Finding well-defined clusters in data represents a fundamental challenge for many data-driven applications, and largely depends on good data representation. Drawing on literature regarding representation learning, studies suggest that one key characteristic of good latent representations is the ability to produce semantically mixed outputs when decoding linear interpolations of two latent representations. We propose the Mixing Consistent Deep Clustering method which encourages interpolations to appear realistic while adding the constraint that interpolations of two data points must look like one of the two inputs. By applying this training method to various clustering (non-)specific autoencoder models we found that using the proposed training method systematically changed the structure of learned representations of a model and it improved clustering performance for the tested ACAI, IDEC, and VAE models on the MNIST, SVHN, and CIFAR-10 datasets. These outcomes have practical implications for numerous real-world clustering tasks, as it shows that the proposed method can be added to existing autoencoders to further improve clustering performance.

READ FULL TEXT
research
06/05/2020

Improving k-Means Clustering Performance with Disentangled Internal Representations

Deep clustering algorithms combine representation learning and clusterin...
research
01/11/2022

Deep clustering with fusion autoencoder

Embracing the deep learning techniques for representation learning in cl...
research
12/25/2018

Deep Representation Learning for Clustering of Health Tweets

Twitter has been a prominent social media platform for mining population...
research
03/14/2018

Latent Tree Variational Autoencoder for Joint Representation Learning and Multidimensional Clustering

Recently, deep learning based clustering methods are shown superior to t...
research
07/19/2018

Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer

Autoencoders provide a powerful framework for learning compressed repres...
research
12/25/2020

Learning Robust Representation for Clustering through Locality Preserving Variational Discriminative Network

Clustering is one of the fundamental problems in unsupervised learning. ...
research
11/19/2015

Binding via Reconstruction Clustering

Disentangled distributed representations of data are desirable for machi...

Please sign up or login with your details

Forgot password? Click here to reset