Interpretable Clustering via Multi-Polytope Machines

12/10/2021
by   Connor Lawless, et al.
0

Clustering is a popular unsupervised learning tool often used to discover groups within a larger population such as customer segments, or patient subtypes. However, despite its use as a tool for subgroup discovery and description - few state-of-the-art algorithms provide any rationale or description behind the clusters found. We propose a novel approach for interpretable clustering that both clusters data points and constructs polytopes around the discovered clusters to explain them. Our framework allows for additional constraints on the polytopes - including ensuring that the hyperplanes constructing the polytope are axis-parallel or sparse with integer coefficients. We formulate the problem of constructing clusters via polytopes as a Mixed-Integer Non-Linear Program (MINLP). To solve our formulation we propose a two phase approach where we first initialize clusters and polytopes using alternating minimization, and then use coordinate descent to boost clustering performance. We benchmark our approach on a suite of synthetic and real world clustering problems, where our algorithm outperforms state of the art interpretable and non-interpretable clustering algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2022

Cluster Explanation via Polyhedral Descriptions

Clustering is an unsupervised learning problem that aims to partition un...
research
12/03/2018

Interpretable Clustering via Optimal Trees

State-of-the-art clustering algorithms use heuristics to partition the f...
research
05/07/2023

A Generalized Framework for Predictive Clustering and Optimization

Clustering is a powerful and extensively used data science tool. While c...
research
02/06/2023

Fair Minimum Representation Clustering

Clustering is an unsupervised learning task that aims to partition data ...
research
02/03/2022

Fast and explainable clustering based on sorting

We introduce a fast and explainable clustering method called CLASSIX. It...
research
07/09/2019

Hierarchical Clustering Supported by Reciprocal Nearest Neighbors

Clustering is a fundamental analysis tool aiming at classifying data poi...
research
02/20/2020

A Scalable Framework for Sparse Clustering Without Shrinkage

Clustering, a fundamental activity in unsupervised learning, is notoriou...

Please sign up or login with your details

Forgot password? Click here to reset