The Price of Explainability for Clustering

04/19/2023
by   Anupam Gupta, et al.
0

Given a set of points in d-dimensional space, an explainable clustering is one where the clusters are specified by a tree of axis-aligned threshold cuts. Dasgupta et al. (ICML 2020) posed the question of the price of explainability: the worst-case ratio between the cost of the best explainable clusterings to that of the best clusterings. We show that the price of explainability for k-medians is at most 1+H_k-1; in fact, we show that the popular Random Thresholds algorithm has exactly this price of explanability, matching the known lower bound constructions. We complement our tight analysis of this particular algorithm by constructing instances where the price of explanability (using any algorithm) is at least (1-o(1)) ln k, showing that our result is best possible, up to lower-order terms. We also improve the price of explanability for the k-means problem to O(k lnln k) from the previous O(k ln k), considerably closing the gap to the lower bounds of Ω(k). Finally, we study the algorithmic question of finding the best explainable clustering: We show that explainable k-medians and k-means cannot be approximated better than O(ln k), under standard complexity-theoretic conjectures. This essentially settles the approximability of explainable k-medians and leaves open the intriguing possibility to get significantly better approximation algorithms for k-means than its price of explainability.

READ FULL TEXT
research
01/05/2021

On the price of explainability for some clustering problems

The price of explainability for a clustering task can be defined as the ...
research
07/01/2021

Almost Tight Approximation Algorithms for Explainable Clustering

Recently, due to an increasing interest for transparency in artificial i...
research
08/20/2022

The computational complexity of some explainable clustering problems

We study the computational complexity of some explainable clustering pro...
research
06/30/2021

Nearly-Tight and Oblivious Algorithms for Explainable Clustering

We study the problem of explainable clustering in the setting first form...
research
06/29/2021

Near-Optimal Explainable k-Means for All Dimensions

Many clustering algorithms are guided by certain cost functions such as ...
research
10/18/2022

Towards Practical Explainability with Cluster Descriptors

With the rapid development of machine learning, improving its explainabi...
research
08/07/2018

Hierarchical Clustering better than Average-Linkage

Hierarchical Clustering (HC) is a widely studied problem in exploratory ...

Please sign up or login with your details

Forgot password? Click here to reset