Multi Layer Peeling for Linear Arrangement and Hierarchical Clustering

05/02/2023
by   Yossi Azar, et al.
0

We present a new multi-layer peeling technique to cluster points in a metric space. A well-known non-parametric objective is to embed the metric space into a simpler structured metric space such as a line (i.e., Linear Arrangement) or a binary tree (i.e., Hierarchical Clustering). Points which are close in the metric space should be mapped to close points/leaves in the line/tree; similarly, points which are far in the metric space should be far in the line or on the tree. In particular we consider the Maximum Linear Arrangement problem <cit.> and the Maximum Hierarchical Clustering problem <cit.> applied to metrics. We design approximation schemes (1 - ϵ approximation for any constant ϵ > 0) for these objectives. In particular this shows that by considering metrics one may significantly improve former approximations (0.5 for Max Linear Arrangement and 0.74 for Max Hierarchical Clustering). Our main technique, which is called multi-layer peeling, consists of recursively peeling off points which are far from the "core" of the metric space. The recursion ends once the core becomes a sufficiently densely weighted metric space (i.e. the average distance is at least a constant times the diameter) or once it becomes negligible with respect to its inner contribution to the objective. Interestingly, the algorithm in the Linear Arrangement case is much more involved than that in the Hierarchical Clustering case, and uses a significantly more delicate peeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2021

Stability for layer points

In the first half this paper, we generalize the theory of layer points f...
research
08/30/2020

An Objective for Hierarchical Clustering in Euclidean Space and its Connection to Bisecting K-means

This paper explores hierarchical clustering in the case where pairs of p...
research
01/26/2021

Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering

Recently, Hierarchical Clustering (HC) has been considered through the l...
research
05/18/2023

On k-means for segments and polylines

We study the problem of k-means clustering in the space of straight-line...
research
05/03/2022

The Price of Hierarchical Clustering

Hierarchical Clustering is a popular tool for understanding the heredita...
research
01/02/2018

Sketching and Clustering Metric Measure Spaces

Two important optimization problems in the analysis of geometric data se...
research
08/13/2021

An Information-theoretic Perspective of Hierarchical Clustering

A combinatorial cost function for hierarchical clustering was introduced...

Please sign up or login with your details

Forgot password? Click here to reset