Interpretable multiclass classification by MDL-based rule lists

05/01/2019
by   Hugo Manuel Proença, et al.
0

Interpretable classifiers have recently witnessed an increase in attention from the data mining community because they are inherently easier to understand and explain than their more complex counterparts. Examples of interpretable classification models include decision trees, rule sets, and rule lists. Learning such models often involves optimizing hyperparameters, which typically requires substantial amounts of data and may result in relatively large models. In this paper, we consider the problem of learning compact yet accurate probabilistic rule lists for multiclass classification. Specifically, we propose a novel formalization based on probabilistic rule lists and the minimum description length (MDL) principle. This results in virtually parameter-free model selection that naturally allows to trade-off model complexity with goodness of fit, by which overfitting and the need for hyperparameter tuning are effectively avoided. Finally, we introduce the Classy algorithm, which greedily finds rule lists according to the proposed criterion. We empirically demonstrate that Classy selects small probabilistic rule lists that outperform state-of-the-art classifiers when it comes to the combination of predictive performance and interpretability. We show that Classy is insensitive to its only parameter, i.e., the candidate set, and that compression on the training set correlates with classification performance, validating our MDL-based selection criterion.

READ FULL TEXT
research
06/17/2022

Truly Unordered Probabilistic Rule Sets for Multi-class Classification

Rule set learning has long been studied and has recently been frequently...
research
08/29/2023

Probabilistic Dataset Reconstruction from Interpretable Models

Interpretability is often pointed out as a key requirement for trustwort...
research
06/16/2020

Discovering outstanding subgroup lists for numeric targets using MDL

The task of subgroup discovery (SD) is to find interpretable description...
research
11/05/2015

Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model

We aim to produce predictive models that are not only accurate, but are ...
research
04/24/2022

Computing the Collection of Good Models for Rule Lists

Since the seminal paper by Breiman in 2001, who pointed out a potential ...
research
03/25/2021

Robust subgroup discovery

We introduce the problem of robust subgroup discovery, i.e., finding a s...
research
04/06/2017

Learning Certifiably Optimal Rule Lists for Categorical Data

We present the design and implementation of a custom discrete optimizati...

Please sign up or login with your details

Forgot password? Click here to reset