DMS: Differentiable Mean Shift for Dataset Agnostic Task Specific Clustering Using Side Information

05/29/2023
by   Michael A. Hobley, et al.
0

We present a novel approach, in which we learn to cluster data directly from side information, in the form of a small set of pairwise examples. Unlike previous methods, with or without side information, we do not need to know the number of clusters, their centers or any kind of distance metric for similarity. Our method is able to divide the same data points in various ways dependant on the needs of a specific task, defined by the side information. Contrastingly, other work generally finds only the intrinsic, most obvious, clusters. Inspired by the mean shift algorithm, we implement our new clustering approach using a custom iterative neural network to create Differentiable Mean Shift (DMS), a state of the art, dataset agnostic, clustering method. We found that it was possible to train a strong cluster definition without enforcing a constraint that each cluster must be presented during training. DMS outperforms current methods in both the intrinsic and non-intrinsic dataset tasks.

READ FULL TEXT

page 1

page 2

page 8

research
04/24/2013

The K-modes algorithm for clustering

Many clustering algorithms exist that estimate a cluster centroid, such ...
research
04/19/2023

Community Detection Using Revised Medoid-Shift Based on KNN

Community detection becomes an important problem with the booming of soc...
research
12/20/2020

Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

Mean shift is a simple interactive procedure that gradually shifts data ...
research
12/14/2016

Border-Peeling Clustering

In this paper, we present a novel non-parametric clustering technique, w...
research
11/19/2015

Neural network-based clustering using pairwise constraints

This paper presents a neural network-based end-to-end clustering framewo...
research
12/04/2014

Iterative Subsampling in Solution Path Clustering of Noisy Big Data

We develop an iterative subsampling approach to improve the computationa...
research
08/28/2012

Document Clustering Evaluation: Divergence from a Random Baseline

Divergence from a random baseline is a technique for the evaluation of d...

Please sign up or login with your details

Forgot password? Click here to reset