Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

10/13/2022
by   Ali Behrouz, et al.
7

Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be directly used. For example, none of the existing techniques produce policies that incorporate inverse propensity weighting on individual data points. We present three algorithms for efficient sparse weighted decision tree optimization. The first approach directly optimizes the weighted loss function; however, it tends to be computationally inefficient for large datasets. Our second approach, which scales more efficiently, transforms weights to integer values and uses data duplication to transform the weighted decision tree optimization problem into an unweighted (but larger) counterpart. Our third algorithm, which scales to much larger datasets, uses a randomized procedure that samples each data point with a probability proportional to its weight. We present theoretical bounds on the error of the two fast methods and show experimentally that these methods can be two orders of magnitude faster than the direct optimization of the weighted loss, without losing significant accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

Generalized Optimal Sparse Decision Trees

Decision tree optimization is notoriously difficult from a computational...
research
12/01/2021

How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization

Sparse decision tree optimization has been one of the most fundamental p...
research
02/29/2020

Decision Trees for Decision-Making under the Predict-then-Optimize Framework

We consider the use of decision trees for decision-making problems under...
research
12/03/2020

Optimal Policy Trees

We propose an approach for learning optimal tree-based prescription poli...
research
09/23/2020

Randomized fast no-loss expert system to play tic tac toe like a human

This paper introduces a blazingly fast, no-loss expert system for Tic Ta...
research
12/09/2021

On multivariate randomized classification trees: l_0-based sparsity, VC dimension and decomposition methods

Decision trees are widely-used classification and regression models beca...
research
04/13/2021

Deep imagination is a close to optimal policy for planning in large decision trees under limited resources

Many decisions involve choosing an uncertain course of actions in deep a...

Please sign up or login with your details

Forgot password? Click here to reset