Near-Optimal Comparison Based Clustering

10/08/2020
by   Michaël Perrot, et al.
0

The goal of clustering is to group similar objects into meaningful partitions. This process is well understood when an explicit similarity measure between the objects is given. However, far less is known when this information is not readily available and, instead, one only observes ordinal comparisons such as "object i is more similar to j than to k." In this paper, we tackle this problem using a two-step procedure: we estimate a pairwise similarity matrix from the comparisons before using a clustering method based on semi-definite programming (SDP). We theoretically show that our approach can exactly recover a planted clustering using a near-optimal number of passive comparisons. We empirically validate our theoretical findings and demonstrate the good behaviour of our method on real data.

READ FULL TEXT
research
11/29/2022

A Revenue Function for Comparison-Based Hierarchical Clustering

Comparison-based learning addresses the problem of learning when, instea...
research
11/02/2018

Foundations of Comparison-Based Hierarchical Clustering

We address the classical problem of hierarchical clustering, but in a fr...
research
06/29/2016

A Semi-Definite Programming approach to low dimensional embedding for unsupervised clustering

This paper proposes a variant of the method of Guédon and Verhynin for e...
research
09/12/2009

Clustering Based on Pairwise Distances When the Data is of Mixed Dimensions

In the context of clustering, we consider a generative model in a Euclid...
research
02/26/2020

Query-Efficient Correlation Clustering

Correlation clustering is arguably the most natural formulation of clust...
research
04/03/2021

Semi matrix-free twogrid shifted Laplacian preconditioner for the Helmholtz equation with near optimal shifts

Due to its significance in terms of wave phenomena a considerable effort...
research
06/27/2018

Implementation of a Near-Optimal Complex Root Clustering Algorithm

We describe Ccluster, a software for computing natural ϵ-clusters of com...

Please sign up or login with your details

Forgot password? Click here to reset