Tangles and Hierarchical Clustering

03/16/2022
by   Eva Fluck, et al.
0

We establish a connection between tangles, a concept from structural graph theory that plays a central role in Robertson and Seymour's graph minor project, and hierarchical clustering. Tangles cannot only be defined for graphs, but in fact for arbitrary connectivity functions, which are functions defined on the subsets of some finite universe. In typical clustering applications these universes consist of points in some metric space. Connectivity functions are usually required to be submodular. It is our first contribution to show that the central duality theorem connecting tangles with hierarchical decompositions (so-called branch decompositions) also holds if submodularity is replaced by a different property that we call maximum-submodular. We then define a connectivity function on finite data sets in an arbitrary metric space and prove that its tangles are in one-to-one correspondence with the clusters obtained by applying the well-known single linkage clustering algorithms to the same data set. Lastly we generalize this correspondence for any hierarchical clustering. We show that the data structure that represents hierarchical clustering results, called dendograms, are equivalent to maximum-submodular connectivity functions and their tangles. The idea of viewing tangles as clusters has first been proposed by Diestel and Whittle in 2016 as an approach to image segmentation. To the best of our knowledge, our result is the first that establishes a precise technical connection between tangles and clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Supermodular Rank: Set Function Decomposition and Optimization

We define the supermodular rank of a function on a lattice. This is the ...
research
04/26/2020

Order preserving hierarchical agglomerative clustering of strict posets

We present a method for hierarchical clustering of directed acyclic grap...
research
12/16/2021

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

Hierarchical clustering studies a recursive partition of a data set into...
research
09/03/2021

Stability for layer points

In the first half this paper, we generalize the theory of layer points f...
research
03/24/2021

Isolating Cuts, (Bi-)Submodularity, and Faster Algorithms for Global Connectivity Problems

Li and Panigrahi, in recent work, obtained the first deterministic algor...
research
11/24/2010

Classifying Clustering Schemes

Many clustering schemes are defined by optimizing an objective function ...
research
10/16/2019

FISHDBC: Flexible, Incremental, Scalable, Hierarchical Density-Based Clustering for Arbitrary Data and Distance

FISHDBC is a flexible, incremental, scalable, and hierarchical density-b...

Please sign up or login with your details

Forgot password? Click here to reset