Variational Clustering: Leveraging Variational Autoencoders for Image Clustering

05/10/2020
by   Vignesh Prasad, et al.
42

Recent advances in deep learning have shown their ability to learn strong feature representations for images. The task of image clustering naturally requires good feature representations to capture the distribution of the data and subsequently differentiate data points from one another. Often these two aspects are dealt with independently and thus traditional feature learning alone does not suffice in partitioning the data meaningfully. Variational Autoencoders (VAEs) naturally lend themselves to learning data distributions in a latent space. Since we wish to efficiently discriminate between different clusters in the data, we propose a method based on VAEs where we use a Gaussian Mixture prior to help cluster the images accurately. We jointly learn the parameters of both the prior and the posterior distributions. Our method represents a true Gaussian Mixture VAE. This way, our method simultaneously learns a prior that captures the latent distribution of the images and a posterior to help discriminate well between data points. We also propose a novel reparametrization of the latent space consisting of a mixture of discrete and continuous variables. One key takeaway is that our method generalizes better across different datasets without using any pre-training or learnt models, unlike existing methods, allowing it to be trained from scratch in an end-to-end manner. We verify our efficacy and generalizability experimentally by achieving state-of-the-art results among unsupervised methods on a variety of datasets. To the best of our knowledge, we are the first to pursue image clustering using VAEs in a purely unsupervised manner on real image datasets.

READ FULL TEXT

page 1

page 8

page 9

research
06/09/2021

Multi-Facet Clustering Variational Autoencoders

Work in deep clustering focuses on finding a single partition of data. H...
research
09/22/2021

Deep Variational Clustering Framework for Self-labeling of Large-scale Medical Images

We propose a Deep Variational Clustering (DVC) framework for unsupervise...
research
07/25/2021

Invariance-based Multi-Clustering of Latent Space Embeddings for Equivariant Learning

Variational Autoencoders (VAEs) have been shown to be remarkably effecti...
research
11/08/2016

Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders

We study a variant of the variational autoencoder model (VAE) with a Gau...
research
06/19/2020

Deep Transformation-Invariant Clustering

Recent advances in image clustering typically focus on learning better d...
research
12/05/2019

Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

The clustering of unlabeled raw images is a daunting task, which has rec...
research
02/28/2017

Learning Discrete Representations via Information Maximizing Self-Augmented Training

Learning discrete representations of data is a central machine learning ...

Please sign up or login with your details

Forgot password? Click here to reset