An objective function for order preserving hierarchical clustering

09/09/2021
by   Daniel Bakkelund, et al.
0

We present an objective function for similarity based hierarchical clustering of partially ordered data that preserves the partial order in the sense that if x ≤ y, and if [x] and [y] are the respective clusters of x and y, then there is an order relation ≤' on the clusters for which [x] ≤' |y]. The model distinguishes itself from existing methods and models for clustering of ordered data in that the order relation and the similarity are combined to obtain an optimal hierarchical clustering seeking to satisfy both, and that the order relation is equipped with a pairwise level of comparability in the range [0,1]. In particular, if the similarity and the order relation are not aligned, then order preservation may have to yield in favor of clustering. Finding an optimal solution is NP-hard, so we provide a polynomial time approximation algorithm, with a relative performance guarantee of O(log^3/2n), based on successive applications of directed sparsest cut. The model is an extension of the Dasgupta cost function for divisive hierarchical clustering.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro