Between-Sample Relationship in Learning Tabular Data Using Graph and Attention Networks

06/11/2023
by   Shourav B. Rabbani, et al.
0

Traditional machine learning assumes samples in tabular data to be independent and identically distributed (i.i.d). This assumption may miss useful information within and between sample relationships in representation learning. This paper relaxes the i.i.d assumption to learn tabular data representations by incorporating between-sample relationships for the first time using graph neural networks (GNN). We investigate our hypothesis using several GNNs and state-of-the-art (SOTA) deep attention models to learn the between-sample relationship on ten tabular data sets by comparing them to traditional machine learning methods. GNN methods show the best performance on tabular data with large feature-to-sample ratios. Our results reveal that attention-based GNN methods outperform traditional machine learning on five data sets and SOTA deep tabular learning methods on three data sets. Between-sample learning via GNN and deep attention methods yield the best classification accuracy on seven of the ten data sets. This suggests that the i.i.d assumption may not always hold for most tabular data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2023

Towards Deep Attention in Graph Neural Networks: Problems and Remedies

Graph neural networks (GNNs) learn the representation of graph-structure...
research
05/01/2023

Attention-based Spatial-Temporal Graph Neural ODE for Traffic Prediction

Traffic forecasting is an important issue in intelligent traffic systems...
research
03/28/2022

DAMNETS: A Deep Autoregressive Model for Generating Markovian Network Time Series

In this work, we introduce DAMNETS, a deep generative model for Markovia...
research
10/26/2019

Understanding Isomorphism Bias in Graph Data Sets

In recent years there has been a rapid increase in classification method...
research
11/26/2019

Adventures in Multi-Omics I: Combining heterogeneous data sets via relationships matrices

In this article, we propose a covariance based method for combining impa...
research
09/08/2021

Multiscale Laplacian Learning

Machine learning methods have greatly changed science, engineering, fina...
research
01/10/2022

Wind Park Power Prediction: Attention-Based Graph Networks and Deep Learning to Capture Wake Losses

With the increased penetration of wind energy into the power grid, it ha...

Please sign up or login with your details

Forgot password? Click here to reset