Hierarchical Clustering: a 0.585 Revenue Approximation

06/02/2020
by   Noga Alon, et al.
0

Hierarchical Clustering trees have been widely accepted as a useful form of clustering data, resulting in a prevalence of adopting fields including phylogenetics, image analysis, bioinformatics and more. Recently, Dasgupta (STOC 16') initiated the analysis of these types of algorithms through the lenses of approximation. Later, the dual problem was considered by Moseley and Wang (NIPS 17') dubbing it the Revenue goal function. In this problem, given a nonnegative weight w_ij for each pair i,j ∈ [n]={1,2, … ,n}, the objective is to find a tree T whose set of leaves is [n] that maximizes the function ∑_i<j ∈ [n] w_ij (n -|T_ij|), where |T_ij| is the number of leaves in the subtree rooted at the least common ancestor of i and j. In our work we consider the revenue goal function and prove the following results. First, we prove the existence of a bisection (i.e., a tree of depth 2 in which the root has two children, each being a parent of n/2 leaves) which approximates the general optimal tree solution up to a factor of 1/2 (which is tight). Second, we apply this result in order to prove a 2/3p approximation for the general revenue problem, where p is defined as the approximation ratio of the Max-Uncut Bisection problem. Since p is known to be at least 0.8776 (Wu et al., 2015, Austrin et al., 2016), we get a 0.585 approximation algorithm for the revenue problem. This improves a sequence of earlier results which culminated in an 0.4246-approximation guarantee (Ahmadian et al., 2019).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2021

Hierarchical Clustering: New Bounds and Objective

Hierarchical Clustering has been studied and used extensively as a metho...
research
09/20/2019

Online Hierarchical Clustering Approximations

Hierarchical clustering is a widely used approach for clustering dataset...
research
08/30/2021

Approximation algorithms for priority Steiner tree problems

In the Priority Steiner Tree (PST) problem, we are given an undirected g...
research
01/26/2021

Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering

Recently, Hierarchical Clustering (HC) has been considered through the l...
research
10/07/2022

An Approximation Algorithm for Distance-Constrained Vehicle Routing on Trees

In the Distance-constrained Vehicle Routing Problem (DVRP), we are given...
research
06/10/2022

A Proof of the Tree of Shapes in n-D

In this paper, we prove that the self-dual morphological hierarchical st...
research
12/15/2020

Objective-Based Hierarchical Clustering of Deep Embedding Vectors

We initiate a comprehensive experimental study of objective-based hierar...

Please sign up or login with your details

Forgot password? Click here to reset