Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features

06/23/2022
by   Rahul Mazumder, et al.
0

Decision trees are one of the most useful and popular methods in the machine learning toolbox. In this paper, we consider the problem of learning optimal decision trees, a combinatorial optimization problem that is challenging to solve at scale. A common approach in the literature is to use greedy heuristics, which may not be optimal. Recently there has been significant interest in learning optimal decision trees using various approaches (e.g., based on integer programming, dynamic programming) – to achieve computational scalability, most of these approaches focus on classification tasks with binary features. In this paper, we present a new discrete optimization method based on branch-and-bound (BnB) to obtain optimal decision trees. Different from existing customized approaches, we consider both regression and classification tasks with continuous features. The basic idea underlying our approach is to split the search space based on the quantiles of the feature distribution – leading to upper and lower bounds for the underlying optimization problem along the BnB iterations. Our proposed algorithm Quant-BnB shows significant speedups compared to existing approaches for shallow optimal trees on various real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2021

Robust Optimal Classification Trees Against Adversarial Examples

Decision trees are a popular choice of explainable model, but just like ...
research
10/26/2021

Learning Optimal Decision Trees Using MaxSAT

We present a Combinatorial Optimization approach based on Maximum Satisf...
research
07/03/2019

Optimal Decision Trees for the Algorithm Selection Problem: Integer Programming Based Approaches

Even though it is well known that for most relevant computational proble...
research
12/09/2021

On multivariate randomized classification trees: l_0-based sparsity, VC dimension and decomposition methods

Decision trees are widely-used classification and regression models beca...
research
04/15/2022

An interpretable machine learning approach for ferroalloys consumptions

This paper is devoted to a practical method for ferroalloys consumption ...
research
05/28/2022

Optimal Decision Diagrams for Classification

Decision diagrams for classification have some notable advantages over d...
research
03/02/2018

Optimization with Gradient-Boosted Trees and Risk Control

Decision trees effectively represent the sparse, high dimensional and no...

Please sign up or login with your details

Forgot password? Click here to reset