bsnsing: A decision tree induction method based on recursive optimal boolean rule composition

05/30/2022
by   Yanchao Liu, et al.
0

This paper proposes a new mixed-integer programming (MIP) formulation to optimize split rule selection in the decision tree induction process, and develops an efficient search algorithm that is able to solve practical instances of the MIP model faster than commercial solvers. The formulation is novel for it directly maximizes the Gini reduction, an effective split selection criterion which has never been modeled in a mathematical program for its nonconvexity. The proposed approach differs from other optimal classification tree models in that it does not attempt to optimize the whole tree, therefore the flexibility of the recursive partitioning scheme is retained and the optimization model is more amenable. The approach is implemented in an open-source R package named bsnsing. Benchmarking experiments on 75 open data sets suggest that bsnsing trees are the most capable of discriminating new cases compared to trees trained by other decision tree codes including the rpart, C50, party and tree packages in R. Compared to other optimal decision tree packages, including DL8.5, OSDT, GOSDT and indirectly more, bsnsing stands out in its training speed, ease of use and broader applicability without losing in prediction accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2022

Rethink Decision Tree Traversal

We will show how to implement binary decision tree traversal in the lang...
research
03/27/2013

Decision Tree Induction Systems: A Bayesian Analysis

Decision tree induction systems are being used for knowledge acquisition...
research
03/27/2018

A Decision Tree Approach to Predicting Recidivism in Domestic Violence

Domestic violence (DV) is a global social and public health issue that i...
research
11/14/2016

Splitting matters: how monotone transformation of predictor variables may improve the predictions of decision tree models

It is widely believed that the prediction accuracy of decision tree mode...
research
07/28/2023

ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

ODTLearn is an open-source Python package that provides methods for lear...
research
08/12/2019

Decision Trees for Complexity Reduction in Video Compression

This paper proposes a method for complexity reduction in practical video...
research
09/04/2018

Maximizing net income of the auction waterfall with an abort decision tree

An online auction waterfall for an ad impression may contain auctions th...

Please sign up or login with your details

Forgot password? Click here to reset