Tightness of prescriptive tree-based mixed-integer optimization formulations

02/28/2023
by   Max Biggs, et al.
0

We focus on modeling the relationship between an input feature vector and the predicted outcome of a trained decision tree using mixed-integer optimization. This can be used in many practical applications where a decision tree or tree ensemble is incorporated into an optimization problem to model the predicted outcomes of a decision. We propose tighter mixed-integer optimization formulations than those previously introduced. Existing formulations can be shown to have linear relaxations that have fractional extreme points, even for the simple case of modeling a single decision tree. A formulation we propose, based on a projected union of polyhedra approach, is ideal for a single decision tree. While the formulation is generally not ideal for tree ensembles or if additional constraints are added, it generally has fewer extreme points, leading to a faster time to solve, particularly if the formulation has relatively few trees. However, previous work has shown that formulations based on a binary representation of the feature vector perform well computationally and hence are attractive for use in practical applications. We present multiple approaches to tighten existing formulations with binary vectors, and show that fractional extreme points are removed when there are multiple splits on the same feature. At an extreme, we prove that this results in ideal formulations for tree ensembles modeling a one-dimensional feature vector. Building on this result, we also show via numerical simulations that these additional constraints result in significantly tighter linear relaxations when the feature vector is low dimensional. We also present instances where the time to solve to optimality is significantly improved using these formulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2018

Convex Hull Formulations for Mixed-Integer Multilinear Functions

In this paper, we present convex hull formulations for a mixed-integer, ...
research
01/21/2019

Mixed integer formulations using natural variables for single machine scheduling around a common due date

While almost all existing works which optimally solve just-in-time sched...
research
05/30/2017

Optimization of Tree Ensembles

Tree ensemble models such as random forests and boosted trees are among ...
research
06/10/2022

Mixed integer linear optimization formulations for learning optimal binary classification trees

Decision trees are powerful tools for classification and regression that...
research
01/11/2021

Mixed-Integer Approaches to Constrained Optimum Communication Spanning Tree Problem

Several novel mixed-integer linear and bilinear formulations are propose...
research
11/20/2021

Modeling Design and Control Problems Involving Neural Network Surrogates

We consider nonlinear optimization problems that involve surrogate model...
research
12/23/2022

A Cut-and-solve Algorithm for Virtual Machine Consolidation Problem

The virtual machine consolidation problem (VMCP) attempts to determine w...

Please sign up or login with your details

Forgot password? Click here to reset