Rule Covering for Interpretation and Boosting

07/13/2020
by   Ş. İlker Birbil, et al.
0

We propose two algorithms for interpretation and boosting of tree-based ensemble methods. Both algorithms make use of mathematical programming models that are constructed with a set of rules extracted from an ensemble of decision trees. The objective is to obtain the minimum total impurity with the least number of rules that cover all the samples. The first algorithm uses the collection of decision trees obtained from a trained random forest model. Our numerical results show that the proposed rule covering approach selects only a few rules that could be used for interpreting the random forest model. Moreover, the resulting set of rules closely matches the accuracy level of the random forest model. Inspired by the column generation algorithm in linear programming, our second algorithm uses a rule generation scheme for boosting decision trees. We use the dual optimal solutions of the linear programming models as sample weights to obtain only those rules that would improve the accuracy. With a computational study, we observe that our second algorithm performs competitively with the other well-known boosting methods. Our implementations also demonstrate that both algorithms can be trivially coupled with the existing random forest and decision tree packages.

READ FULL TEXT
research
03/29/2022

Explaining random forest prediction through diverse rulesets

Tree-ensemble algorithms, such as random forest, are effective machine l...
research
02/16/2017

Tree Ensembles with Rule Structured Horseshoe Regularization

We propose a new Bayesian model for flexible nonlinear regression and cl...
research
04/21/2021

Discovering Classification Rules for Interpretable Learning with Linear Programming

Rules embody a set of if-then statements which include one or more condi...
research
07/02/2013

Comparing various regression methods on ensemble strategies in differential evolution

Differential evolution possesses a multitude of various strategies for g...
research
01/14/2020

Interpretation and Simplification of Deep Forest

This paper proposes a new method for interpreting and simplifying a blac...
research
04/05/2020

XtracTree for Regulator Validation of Bagging Methods Used in Retail Banking

Bootstrap aggregation, known as bagging, is one of the most popular ense...

Please sign up or login with your details

Forgot password? Click here to reset