SIRUS: making random forests interpretable

08/19/2019
by   Clément Bénard, et al.
0

State-of-the-art learning algorithms, such as random forests or neural networks, are often qualified as "black-boxes" because of the high number and complexity of operations involved in their prediction mechanism. This lack of interpretability is a strong limitation for applications involving critical decisions, typically the analysis of production processes in the manufacturing industry. In such critical contexts, models have to be interpretable, i.e., simple, stable, and predictive. To address this issue, we design SIRUS (Stable and In-terpretable RUle Set), a new classification algorithm based on random forests, which takes the form of a short list of rules. While simple models are usually unstable with respect to data perturbation, SIRUS achieves a remarkable stability improvement over cutting-edge methods. Furthermore, SIRUS inherits a predictive accuracy close to random forests, combined with the simplicity of decision trees. These properties are assessed both from a theoretical and empirical point of view, through extensive numerical experiments based on our R/C++ software implementation sirus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Interpretable Random Forests via Rule Extraction

We introduce SIRUS (Stable and Interpretable RUle Set) for regression, a...
research
11/11/2019

Simplifying Random Forests: On the Trade-off between Interpretability and Accuracy

We analyze the trade-off between model complexity and accuracy for rando...
research
07/11/2020

Towards Robust Classification with Deep Generative Forests

Decision Trees and Random Forests are among the most widely used machine...
research
07/22/2015

Banzhaf Random Forests

Random forests are a type of ensemble method which makes predictions by ...
research
02/08/2022

Is interpolation benign for random forests?

Statistical wisdom suggests that very complex models, interpolating trai...
research
02/21/2020

An Investigation of Interpretability Techniques for Deep Learning in Predictive Process Analytics

This paper explores interpretability techniques for two of the most succ...
research
06/29/2022

TE2Rules: Extracting Rule Lists from Tree Ensembles

Tree Ensemble (TE) models (e.g. Gradient Boosted Trees and Random Forest...

Please sign up or login with your details

Forgot password? Click here to reset