Adaptively Pruning Features for Boosted Decision Trees

05/19/2018
by   Maryam Aziz, et al.
0

Boosted decision trees enjoy popularity in a variety of applications; however, for large-scale datasets, the cost of training a decision tree in each round can be prohibitively expensive. Inspired by ideas from the multi-arm bandit literature, we develop a highly efficient algorithm for computing exact greedy-optimal decision trees, outperforming the state-of-the-art Quick Boost method. We further develop a framework for deriving lower bounds on the problem that applies to a wide family of conceivable algorithms for the task (including our algorithm and Quick Boost), and we demonstrate empirically on a wide variety of data sets that our algorithm is near-optimal within this family of algorithms. We also derive a lower bound applicable to any algorithm solving the task, and we demonstrate that our algorithm empirically achieves performance close to this best-achievable lower bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

Properly learning decision trees in almost polynomial time

We give an n^O(loglog n)-time membership query algorithm for properly an...
research
10/12/2022

Superpolynomial Lower Bounds for Decision Tree Learning and Testing

We establish new hardness results for decision tree optimization problem...
research
06/15/2020

Generalized Optimal Sparse Decision Trees

Decision tree optimization is notoriously difficult from a computational...
research
10/18/2021

A cautionary tale on fitting decision trees to data from additive models: generalization lower bounds

Decision trees are important both as interpretable models amenable to hi...
research
02/15/2021

Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent

Decision trees provide a rich family of highly non-linear but efficient ...
research
07/11/2019

On the Optimality of Trees Generated by ID3

Since its inception in the 1980s, ID3 has become one of the most success...
research
09/12/2018

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Gradient boosted decision trees (GBDTs) have seen widespread adoption in...

Please sign up or login with your details

Forgot password? Click here to reset