Near-optimal Algorithms for Explainable k-Medians and k-Means

07/02/2021
by   Konstantin Makarychev, et al.
0

We consider the problem of explainable k-medians and k-means introduced by Dasgupta, Frost, Moshkovitz, and Rashtchian (ICML 2020). In this problem, our goal is to find a threshold decision tree that partitions data into k clusters and minimizes the k-medians or k-means objective. The obtained clustering is easy to interpret because every decision node of a threshold tree splits data based on a single feature into two groups. We propose a new algorithm for this problem which is Õ(log k) competitive with k-medians with ℓ_1 norm and Õ(k) competitive with k-means. This is an improvement over the previous guarantees of O(k) and O(k^2) by Dasgupta et al (2020). We also provide a new algorithm which is O(log^3/2 k) competitive for k-medians with ℓ_2 norm. Our first algorithm is near-optimal: Dasgupta et al (2020) showed a lower bound of Ω(log k) for k-medians; in this work, we prove a lower bound of Ω̃(k) for k-means. We also provide a lower bound of Ω(log k) for k-medians with ℓ_2 norm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2023

Random Cuts are Optimal for Explainable k-Medians

We show that the RandomCoordinateCut algorithm gives the optimal competi...
research
11/04/2021

Explainable k-means. Don't be greedy, plant bigger trees!

We provide a new bi-criteria Õ(log^2 k) competitive algorithm for explai...
research
12/31/2019

Near-Optimal Schedules for Simultaneous Multicasts

We study the store-and-forward packet routing problem for simultaneous m...
research
04/10/2023

Geometry of Rounding: Near Optimal Bounds and a New Neighborhood Sperner's Lemma

A partition 𝒫 of ℝ^d is called a (k,ε)-secluded partition if, for every ...
research
05/04/2023

Impossibility of Depth Reduction in Explainable Clustering

Over the last few years Explainable Clustering has gathered a lot of att...
research
06/29/2021

Near-Optimal Explainable k-Means for All Dimensions

Many clustering algorithms are guided by certain cost functions such as ...
research
06/08/2020

Generalizing the Sharp Threshold Phenomenon for the Distributed Complexity of the Lovász Local Lemma

Recently, Brandt, Maus and Uitto [PODC'19] showed that, in a restricted ...

Please sign up or login with your details

Forgot password? Click here to reset