Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs

06/16/2023
by   Steinar Laenen, et al.
8

This paper presents two efficient hierarchical clustering (HC) algorithms with respect to Dasgupta's cost function. For any input graph G with a clear cluster-structure, our designed algorithms run in nearly-linear time in the input size of G, and return an O(1)-approximate HC tree with respect to Dasgupta's cost function. We compare the performance of our algorithm against the previous state-of-the-art on synthetic and real-world datasets and show that our designed algorithm produces comparable or better HC trees with much lower running time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2018

A New Cost Function for Hierarchical Cluster Trees

Hierarchical clustering has been a popular method in various data analys...
research
01/29/2019

Computing Optimal Assignments in Linear Time for Graph Matching

Finding an optimal assignment between two sets of objects is a fundament...
research
08/25/2020

Efficient Hierarchical Clustering for Classification and Anomaly Detection

We address the problem of large scale real-time classification of conten...
research
04/14/2021

Exact and Approximate Hierarchical Clustering Using A*

Hierarchical clustering is a critical task in numerous domains. Many app...
research
10/15/2022

A Nearly Optimal Size Coreset Algorithm with Nearly Linear Time

A coreset is a point set containing information about geometric properti...
research
02/26/2020

Compact Representation of Uncertainty in Hierarchical Clustering

Hierarchical clustering is a fundamental task often used to discover mea...
research
08/13/2021

An Information-theoretic Perspective of Hierarchical Clustering

A combinatorial cost function for hierarchical clustering was introduced...

Please sign up or login with your details

Forgot password? Click here to reset