Attention augmented differentiable forest for tabular data

10/02/2020
by   Yingshi Chen, et al.
0

Differentiable forest is an ensemble of decision trees with full differentiability. Its simple tree structure is easy to use and explain. With full differentiability, it would be trained in the end-to-end learning framework with gradient-based optimization method. In this paper, we propose tree attention block(TAB) in the framework of differentiable forest. TAB block has two operations, squeeze and regulate. The squeeze operation would extract the characteristic of each tree. The regulate operation would learn nonlinear relations between these trees. So TAB block would learn the importance of each tree and adjust its weight to improve accuracy. Our experiment on large tabular dataset shows attention augmented differentiable forest would get comparable accuracy with gradient boosted decision trees(GBDT), which is the state-of-the-art algorithm for tabular datasets. And on some datasets, our model has higher accuracy than best GBDT libs (LightGBM, Catboost, and XGBoost). Differentiable forest model supports batch training and batch size is much smaller than the size of training set. So on larger data sets, its memory usage is much lower than GBDT model. The source codes are available at https://github.com/closest-git/QuantumForest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/29/2020

Deep differentiable forest with sparse attention for the tabular data

We present a general architecture of deep differentiable forest and its ...
research
01/26/2020

LiteMORT: A memory efficient gradient boosting tree system on adaptive compact distributions

Gradient boosted decision trees (GBDT) is the leading algorithm for many...
research
04/07/2020

Learning Unsplit-field-based PML for the FDTD Method by Deep Differentiable Forest

Alternative unsplit-filed-based absorbing boundary condition (ABC) compu...
research
09/05/2011

Learning Nonlinear Functions Using Regularized Greedy Forest

We consider the problem of learning a forest of nonlinear decision rules...
research
10/27/2020

A short note on the decision tree based neural turing machine

Turing machine and decision tree have developed independently for a long...
research
10/16/2022

Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

The need to learn from positive and unlabeled data, or PU learning, aris...
research
06/05/2021

Making CNNs Interpretable by Building Dynamic Sequential Decision Forests with Top-down Hierarchy Learning

In this paper, we propose a generic model transfer scheme to make Convlu...

Please sign up or login with your details

Forgot password? Click here to reset