Geometrical Homogeneous Clustering for Image Data Reduction

08/27/2022
by   Shril Mody, et al.
0

In this paper, we present novel variations of an earlier approach called homogeneous clustering algorithm for reducing dataset size. The intuition behind the approaches proposed in this paper is to partition the dataset into homogeneous clusters and select some images which contribute significantly to the accuracy. Selected images are the proper subset of the training data and thus are human-readable. We propose four variations upon the baseline algorithm-RHC. The intuition behind the first approach, RHCKON, is that the boundary points contribute significantly towards the representation of clusters. It involves selecting k farthest and one nearest neighbour of the centroid of the clusters. In the following two approaches (KONCW and CWKC), we introduce the concept of cluster weights. They are based on the fact that larger clusters contribute more than smaller sized clusters. The final variation is GHCIDR which selects points based on the geometrical aspect of data distribution. We performed the experiments on two deep learning models- Fully Connected Networks (FCN) and VGG1. We experimented with the four variants on three datasets- MNIST, CIFAR10, and Fashion-MNIST. We found that GHCIDR gave the best accuracy of 99.35 of 87.27 respectively.

READ FULL TEXT
research
09/06/2022

Merged-GHCIDR: Geometrical Approach to Reduce Image Data

The computational resources required to train a model have been increasi...
research
10/09/2021

K-Splits: Improved K-Means Clustering Algorithm to Automatically Detect the Number of Clusters

This paper introduces k-splits, an improved hierarchical algorithm based...
research
12/14/2016

Border-Peeling Clustering

In this paper, we present a novel non-parametric clustering technique, w...
research
06/09/2021

Very Compact Clusters with Structural Regularization via Similarity and Connectivity

Clustering algorithms have significantly improved along with Deep Neural...
research
03/03/2023

Contrastive Hierarchical Clustering

Deep clustering has been dominated by flat models, which split a dataset...
research
12/08/2018

Learning Graph Representation via Formal Concept Analysis

We present a novel method that can learn a graph representation from mul...
research
03/03/2014

Matching Image Sets via Adaptive Multi Convex Hull

Traditional nearest points methods use all the samples in an image set t...

Please sign up or login with your details

Forgot password? Click here to reset