Constructing classification trees using column generation

10/15/2018
by   Murat Firat, et al.
0

This paper explores the use of Column Generation (CG) techniques in constructing univariate binary decision trees for classification tasks. We propose a novel Integer Linear Programming (ILP) formulation, based on paths in decision trees. We show that the associated pricing problem is NP-hard and propose a random procedure for column selection. In addition, to speed up column generation, we use a restricted parameter set via a sampling procedure using the well-known CART algorithm. Extensive numerical experiments show that our approach outperforms the state-of-the-art ILP-based algorithms in the recent literature both in computation time and solution quality. We also find better solutions that have higher training and testing accuracy than an optimized version of CART. Furthermore, our approach is capable of handling big data sets with tens of thousands of data rows, unlike other ILP-based algorithms. In addition, our approach has the advantage of being able to easily incorporate different objectives.

READ FULL TEXT
research
08/22/2023

Revisiting column-generation-based matheuristic for learning classification trees

Decision trees are highly interpretable models for solving classificatio...
research
10/03/2021

Graph Generation: A New Approach to Solving Expanded Linear Programming Relaxations

In this article we introduce Graph Generation, an enhanced Column Genera...
research
12/14/2019

A column-generation-based matheuristic for periodic train timetabling with integrated passenger routing

In this study, the periodic train timetabling problem is formulated usin...
research
11/06/2020

A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

Several recent publications report advances in training optimal decision...
research
07/03/2019

Optimal Decision Trees for the Algorithm Selection Problem: Integer Programming Based Approaches

Even though it is well known that for most relevant computational proble...
research
07/20/2022

Constrained Prescriptive Trees via Column Generation

With the abundance of available data, many enterprises seek to implement...
research
05/14/2021

Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide

Cycle representatives of persistent homology classes can be used to prov...

Please sign up or login with your details

Forgot password? Click here to reset