Optimal Decision Trees For Interpretable Clustering with Constraints

01/30/2023
by   Pouya Shati, et al.
0

Constrained clustering is a semi-supervised task that employs a limited amount of labelled data, formulated as constraints, to incorporate domain-specific knowledge and to significantly improve clustering accuracy. Previous work has considered exact optimization formulations that can guarantee optimal clustering while satisfying all constraints, however these approaches lack interpretability. Recently, decision-trees have been used to produce inherently interpretable clustering solutions, however existing approaches do not support clustering constraints and do not provide strong theoretical guarantees on solution quality. In this work, we present a novel SAT-based framework for interpretable clustering that supports clustering constraints and that also provides strong theoretical guarantees on solution quality. We also present new insight into the trade-off between interpretability and satisfaction of such user-provided constraints. Our framework is the first approach for interpretable and constrained clustering. Experiments with a range of real-world and synthetic datasets demonstrate that our approach can produce high-quality and interpretable constrained clustering solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/17/2019

Balancing the Tradeoff Between Clustering Value and Interpretability

Graph clustering groups entities – the vertices of a graph – based on th...
research
08/26/2022

A Framework for Inherently Interpretable Optimization Models

With dramatic improvements in optimization software, the solution of lar...
research
12/03/2018

Interpretable Clustering via Optimal Trees

State-of-the-art clustering algorithms use heuristics to partition the f...
research
01/20/2022

Scalable k-d trees for distributed data

Data structures known as k-d trees have numerous applications in scienti...
research
07/16/2023

Using Decision Trees for Interpretable Supervised Clustering

In this paper, we address an issue of finding explainable clusters of cl...
research
12/05/2018

MLIC: A MaxSAT-Based framework for learning interpretable classification rules

The wide adoption of machine learning approaches in the industry, govern...
research
11/27/2017

One-Shot Coresets: The Case of k-Clustering

Scaling clustering algorithms to massive data sets is a challenging task...

Please sign up or login with your details

Forgot password? Click here to reset