Explainable k-means. Don't be greedy, plant bigger trees!

11/04/2021
by   Konstantin Makarychev, et al.
0

We provide a new bi-criteria Õ(log^2 k) competitive algorithm for explainable k-means clustering. Explainable k-means was recently introduced by Dasgupta, Frost, Moshkovitz, and Rashtchian (ICML 2020). It is described by an easy to interpret and understand (threshold) decision tree or diagram. The cost of the explainable k-means clustering equals to the sum of costs of its clusters; and the cost of each cluster equals the sum of squared distances from the points in the cluster to the center of that cluster. Our randomized bi-criteria algorithm constructs a threshold decision tree that partitions the data set into (1+δ)k clusters (where δ∈ (0,1) is a parameter of the algorithm). The cost of this clustering is at most Õ(1/δ·log^2 k) times the cost of the optimal unconstrained k-means clustering. We show that this bound is almost optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2021

Near-optimal Algorithms for Explainable k-Medians and k-Means

We consider the problem of explainable k-medians and k-means introduced ...
research
02/28/2020

Explainable k-Means and k-Medians Clustering

Clustering is a popular form of unsupervised learning for geometric data...
research
05/04/2023

Impossibility of Depth Reduction in Explainable Clustering

Over the last few years Explainable Clustering has gathered a lot of att...
research
06/03/2020

ExKMC: Expanding Explainable k-Means Clustering

Despite the popularity of explainable AI, there is limited work on effec...
research
06/29/2021

Near-Optimal Explainable k-Means for All Dimensions

Many clustering algorithms are guided by certain cost functions such as ...
research
12/29/2021

Shallow decision trees for explainable k-means clustering

A number of recent works have employed decision trees for the constructi...
research
09/27/2012

Reclassification formula that provides to surpass K-means method

The paper presents a formula for the reclassification of multidimensiona...

Please sign up or login with your details

Forgot password? Click here to reset