Cluster Explanation via Polyhedral Descriptions

10/17/2022
by   Connor Lawless, et al.
0

Clustering is an unsupervised learning problem that aims to partition unlabelled data points into groups with similar features. Traditional clustering algorithms provide limited insight into the groups they find as their main focus is accuracy and not the interpretability of the group assignments. This has spurred a recent line of work on explainable machine learning for clustering. In this paper we focus on the cluster description problem where, given a dataset and its partition into clusters, the task is to explain the clusters. We introduce a new approach to explain clusters by constructing polyhedra around each cluster while minimizing either the complexity of the resulting polyhedra or the number of features used in the description. We formulate the cluster description problem as an integer program and present a column generation approach to search over an exponential number of candidate half-spaces that can be used to build the polyhedra. To deal with large datasets, we introduce a novel grouping scheme that first forms smaller groups of data points and then builds the polyhedra around the grouped data, a strategy which out-performs simply sub-sampling data. Compared to state of the art cluster description algorithms, our approach is able to achieve competitive interpretability with improved description accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2021

Interpretable Clustering via Multi-Polytope Machines

Clustering is a popular unsupervised learning tool often used to discove...
research
06/18/2019

From Clustering to Cluster Explanations via Neural Networks

A wealth of algorithms have been developed to extract natural cluster st...
research
02/03/2022

Fast and explainable clustering based on sorting

We introduce a fast and explainable clustering method called CLASSIX. It...
research
09/08/2022

A penalized criterion for selecting the number of clusters for K-medians

Clustering is a usual unsupervised machine learning technique for groupi...
research
02/06/2023

Fair Minimum Representation Clustering

Clustering is an unsupervised learning task that aims to partition data ...
research
03/03/2020

EXPLAIN-IT: Towards Explainable AI for Unsupervised Network Traffic Analysis

The application of unsupervised learning approaches, and in particular o...
research
05/24/2021

Deep Descriptive Clustering

Recent work on explainable clustering allows describing clusters when th...

Please sign up or login with your details

Forgot password? Click here to reset