Tabular Data: Deep Learning is Not All You Need

06/06/2021
by   Ravid Shwartz-Ziv, et al.
0

A key element of AutoML systems is setting the types of models that will be used for each type of task. For classification and regression problems with tabular data, the use of tree ensemble models (like XGBoost) is usually recommended. However, several deep learning models for tabular data have recently been proposed, claiming to outperform XGBoost for some use-cases. In this paper, we explore whether these deep models should be a recommended option for tabular data, by rigorously comparing the new deep models to XGBoost on a variety of datasets. In addition to systematically comparing their accuracy, we consider the tuning and computation they require. Our study shows that XGBoost outperforms these deep models across the datasets, including datasets used in the papers that proposed the deep models. We also demonstrate that XGBoost requires much less tuning. On the positive side, we show that an ensemble of the deep models and XGBoost performs better on these datasets than XGBoost alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2018

Are generative deep models for novelty detection truly better?

Many deep models have been recently proposed for anomaly detection. This...
research
07/11/2020

Deep or Simple Models for Semantic Tagging? It Depends on your Data [Experiments]

Semantic tagging, which has extensive applications in text mining, predi...
research
09/21/2023

Precision in Building Extraction: Comparing Shallow and Deep Models using LiDAR Data

Building segmentation is essential in infrastructure development, popula...
research
05/31/2019

Knowledge-augmented Column Networks: Guiding Deep Learning with Advice

Recently, deep models have had considerable success in several tasks, es...
research
05/25/2022

Residual-Concatenate Neural Network with Deep Regularization Layers for Binary Classification

Many complex Deep Learning models are used with different variations for...
research
04/15/2019

Human-Guided Learning of Column Networks: Augmenting Deep Learning with Advice

Recently, deep models have been successfully applied in several applicat...
research
11/16/2017

Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

The lack of interpretability remains a key barrier to the adoption of de...

Please sign up or login with your details

Forgot password? Click here to reset