Massively-Parallel Heat Map Sorting and Applications To Explainable Clustering

09/14/2023
by   Sepideh Aghamolaei, et al.
0

Given a set of points labeled with k labels, we introduce the heat map sorting problem as reordering and merging the points and dimensions while preserving the clusters (labels). A cluster is preserved if it remains connected, i.e., if it is not split into several clusters and no two clusters are merged. We prove the problem is NP-hard and we give a fixed-parameter algorithm with a constant number of rounds in the massively parallel computation model, where each machine has a sublinear memory and the total memory of the machines is linear. We give an approximation algorithm for a NP-hard special case of the problem. We empirically compare our algorithm with k-means and density-based clustering (DBSCAN) using a dimensionality reduction via locality-sensitive hashing on several directed and undirected graphs of email and computer networks.

READ FULL TEXT
research
12/01/2021

On the Complexity of the Geometric Median Problem with Outliers

In the Geometric Median problem with outliers, we are given a finite set...
research
07/09/2019

r-Gather Clustering and r-Gathering on Spider: FPT Algorithms and Hardness

We consider min-max r-gather clustering problem and min-max r-gathering ...
research
10/25/2021

Packing 2D disks into a 3D container

In this article, we consider the problem of finding in three dimensions ...
research
06/15/2021

Correlation Clustering in Constant Many Parallel Rounds

Correlation clustering is a central topic in unsupervised learning, with...
research
02/03/2022

Fast and explainable clustering based on sorting

We introduce a fast and explainable clustering method called CLASSIX. It...
research
05/21/2018

A 5/2-Approximation Algorithm for Coloring Rooted Subtrees of a Degree 3 Tree

A rooted tree R⃗ is a rooted subtree of a tree T if the tree obtained by...
research
02/06/2020

Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability

Improving the explainability of the results from machine learning method...

Please sign up or login with your details

Forgot password? Click here to reset