Interpretation and Simplification of Deep Forest

01/14/2020
by   Sangwon Kim, et al.
0

This paper proposes a new method for interpreting and simplifying a black box model of a deep random forest (RF) using a proposed rule elimination. In deep RF, a large number of decision trees are connected to multiple layers, thereby making an analysis difficult. It has a high performance similar to that of a deep neural network (DNN), but achieves a better generalizability. Therefore, in this study, we consider quantifying the feature contributions and frequency of the fully trained deep RF in the form of a decision rule set. The feature contributions provide a basis for determining how features affect the decision process in a rule set. Model simplification is achieved by eliminating unnecessary rules by measuring the feature contributions. Consequently, the simplified model has fewer parameters and rules than before. Experiment results have shown that a feature contribution analysis allows a black box model to be decomposed for quantitatively interpreting a rule set. The proposed method was successfully applied to various deep RF models and benchmark datasets while maintaining a robust performance despite the elimination of a large number of rules.

READ FULL TEXT
research
07/13/2020

Rule Covering for Interpretation and Boosting

We propose two algorithms for interpretation and boosting of tree-based ...
research
08/29/2022

Interpreting Black-box Machine Learning Models for High Dimensional Datasets

Deep neural networks (DNNs) have been shown to outperform traditional ma...
research
05/30/2016

Forest Floor Visualizations of Random Forests

We propose a novel methodology, forest floor, to visualize and interpret...
research
05/08/2020

Explainable Matrix – Visualization for Global and Local Interpretability of Random Forest Classification Ensembles

Over the past decades, classification models have proven to be one of th...
research
07/17/2018

Explicating feature contribution using Random Forest proximity distances

In Random Forests, proximity distances are a metric representation of da...
research
02/16/2017

Tree Ensembles with Rule Structured Horseshoe Regularization

We propose a new Bayesian model for flexible nonlinear regression and cl...
research
10/15/2020

Interpreting Deep Learning Model Using Rule-based Method

Deep learning models are favored in many research and industry areas and...

Please sign up or login with your details

Forgot password? Click here to reset