Augmented Data as an Auxiliary Plug-in Towards Categorization of Crowdsourced Heritage Data

In this paper, we propose a strategy to mitigate the problem of inefficient clustering performance by introducing data augmentation as an auxiliary plug-in. Classical clustering techniques such as K-means, Gaussian mixture model and spectral clustering are central to many data-driven applications. However, recently unsupervised simultaneous feature learning and clustering using neural networks also known as Deep Embedded Clustering (DEC) has gained prominence. Pioneering works on deep feature clustering focus on defining relevant clustering loss function and choosing the right neural network for extracting features. A central problem in all these cases is data sparsity accompanied by high intra-class and low inter-class variance, which subsequently leads to poor clustering performance and erroneous candidate assignments. Towards this, we employ data augmentation techniques to improve the density of the clusters, thus improving the overall performance. We train a variant of Convolutional Autoencoder (CAE) with augmented data to construct the initial feature space as a novel model for deep clustering. We demonstrate the results of proposed strategy on crowdsourced Indian Heritage dataset. Extensive experiments show consistent improvements over existing works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2018

Deep Density-based Image Clustering

Recently, deep clustering, which is able to perform feature learning tha...
research
08/19/2022

Predicting Exotic Hadron Masses with Data Augmentation Using Multilayer Perceptron

Recently, there have been significant developments in neural networks; t...
research
01/08/2019

Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

Recently, a number of works have studied clustering strategies that comb...
research
08/07/2020

Deep Robust Clustering by Contrastive Learning

Recently, many unsupervised deep learning methods have been proposed to ...
research
03/29/2023

Hard Regularization to Prevent Collapse in Online Deep Clustering without Data Augmentation

Online deep clustering refers to the joint use of a feature extraction n...
research
12/05/2022

Clustering with Neural Network and Index

A new model called Clustering with Neural Network and Index (CNNI) is in...

Please sign up or login with your details

Forgot password? Click here to reset