More layers! End-to-end regression and uncertainty on tabular data with deep learning

12/07/2021
by   Ivan Bondarenko, et al.
0

This paper attempts to analyze the effectiveness of deep learning for tabular data processing. It is believed that decision trees and their ensembles is the leading method in this domain, and deep neural networks must be content with computer vision and so on. But the deep neural network is a framework for building gradient-based hierarchical representations, and this key feature should be able to provide the best processing of generic structured (tabular) data, not just image matrices and audio spectrograms. This problem is considered through the prism of the Weather Prediction track in the Yandex Shifts challenge (in other words, the Yandex Shifts Weather task). This task is a variant of the classical tabular data regression problem. It is also connected with another important problem: generalization and uncertainty in machine learning. This paper proposes an end-to-end algorithm for solving the problem of regression with uncertainty on tabular data, which is based on the combination of four ideas: 1) deep ensemble of self-normalizing neural networks, 2) regression as parameter estimation of the Gaussian target error distribution, 3) hierarchical multitask learning, and 4) simple data preprocessing. Three modifications of the proposed algorithm form the top-3 leaderboard of the Yandex Shifts Weather challenge respectively. This paper considers that this success has occurred due to the fundamental properties of the deep learning algorithm, and tries to prove this.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2019

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

Nowadays, deep neural networks (DNNs) have become the main instrument fo...
research
10/29/2018

Principled Uncertainty Estimation for Deep Neural Networks

When the cost of misclassifying a sample is high, it is useful to have a...
research
05/31/2018

Multi-Layered Gradient Boosting Decision Trees

Multi-layered representation is believed to be the key ingredient of dee...
research
12/06/2022

Domain Generalization Strategy to Train Classifiers Robust to Spatial-Temporal Shift

Deep learning-based weather prediction models have advanced significantl...
research
06/29/2020

Deep Ordinal Regression with Label Diversity

Regression via classification (RvC) is a common method used for regressi...
research
06/27/2019

Hierarchical Data Reduction and Learning

Paper proposes a hierarchical learning strategy for generation of sparse...
research
08/27/2018

Gradient-based Training of Slow Feature Analysis by Differentiable Approximate Whitening

This paper proposes Power Slow Feature Analysis, a gradient-based method...

Please sign up or login with your details

Forgot password? Click here to reset