Near-Optimal Explainable k-Means for All Dimensions

06/29/2021
by   Moses Charikar, et al.
21

Many clustering algorithms are guided by certain cost functions such as the widely-used k-means cost. These algorithms divide data points into clusters with often complicated boundaries, creating difficulties in explaining the clustering decision. In a recent work, Dasgupta, Frost, Moshkovitz, and Rashtchian (ICML 2020) introduced explainable clustering, where the cluster boundaries are axis-parallel hyperplanes and the clustering is obtained by applying a decision tree to the data. The central question here is: how much does the explainability constraint increase the value of the cost function? Given d-dimensional data points, we show an efficient algorithm that finds an explainable clustering whose k-means cost is at most k^1 - 2/d poly(dlog k) times the minimum cost achievable by a clustering without the explainability constraint, assuming k,d≥ 2. Taking the minimum of this bound and the k polylog (k) bound in independent work by Makarychev-Shan (ICML 2021), Gamlath-Jia-Polak-Svensson (2021), or Esfandiari-Mirrokni-Narayanan (2021), we get an improved bound of k^1 - 2/d polylog(k), which we show is optimal for every choice of k,d≥ 2 up to a poly-logarithmic factor in k. For d = 2 in particular, we show an O(log kloglog k) bound, improving near-exponentially over the previous best bound of O(klog k) by Laber and Murtinho (ICML 2021).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

Nearly-Tight and Oblivious Algorithms for Explainable Clustering

We study the problem of explainable clustering in the setting first form...
research
11/04/2021

Explainable k-means. Don't be greedy, plant bigger trees!

We provide a new bi-criteria Õ(log^2 k) competitive algorithm for explai...
research
07/02/2021

Near-optimal Algorithms for Explainable k-Medians and k-Means

We consider the problem of explainable k-medians and k-means introduced ...
research
06/03/2020

ExKMC: Expanding Explainable k-Means Clustering

Despite the popularity of explainable AI, there is limited work on effec...
research
04/19/2023

The Price of Explainability for Clustering

Given a set of points in d-dimensional space, an explainable clustering ...
research
04/26/2022

Polylogarithmic Sketches for Clustering

Given n points in ℓ_p^d, we consider the problem of partitioning points ...
research
05/04/2023

Impossibility of Depth Reduction in Explainable Clustering

Over the last few years Explainable Clustering has gathered a lot of att...

Please sign up or login with your details

Forgot password? Click here to reset