Foundations of Comparison-Based Hierarchical Clustering

11/02/2018
by   Debarghya Ghoshdastidar, et al.
0

We address the classical problem of hierarchical clustering, but in a framework where one does not have access to a representation of the objects or their pairwise similarities. Instead we assume that only a set of comparisons between objects are available in terms of statements of the form "objects i and j are more similar than objects k and l". Such a scenario is commonly encountered in crowdsourcing applications. The focus of this work is to develop comparison-based hierarchical clustering algorithms that do not rely on the principles of ordinal embedding. We propose comparison-based variants of average linkage clustering. We provide statistical guarantees for the proposed methods under a planted partition model for hierarchical clustering. We also empirically demonstrate the performance of the proposed methods on several datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2022

A Revenue Function for Comparison-Based Hierarchical Clustering

Comparison-based learning addresses the problem of learning when, instea...
research
11/30/2021

Hierarchical clustering: visualization, feature importance and model selection

We propose methods for the analysis of hierarchical clustering that full...
research
10/08/2020

Near-Optimal Comparison Based Clustering

The goal of clustering is to group similar objects into meaningful parti...
research
06/17/2016

Generating Object Cluster Hierarchies for Benchmarking

The field of Machine Learning and the topic of clustering within it is s...
research
02/18/2011

Active Clustering: Robust and Efficient Hierarchical Clustering using Adaptively Selected Similarities

Hierarchical clustering based on pairwise similarities is a common tool ...
research
06/18/2020

Guarantees for Hierarchical Clustering by the Sublevel Set method

Meila (2018) introduces an optimization based method called the Sublevel...
research
03/10/2023

Hierarchical Clustering with OWA-based Linkages, the Lance-Williams Formula, and Dendrogram Inversions

Agglomerative hierarchical clustering based on Ordered Weighted Averagin...

Please sign up or login with your details

Forgot password? Click here to reset