Neural Network Architecture for Database Augmentation Using Shared Features

02/02/2023
by   William C. Sleeman IV, et al.
0

The popularity of learning from data with machine learning and neural networks has lead to the creation of many new datasets for almost every problem domain. However, even within a single domain, these datasets are often collected with disparate features, sampled from different sub-populations, and recorded at different time points. Even with the plethora of individual datasets, large data science projects can be difficult as it is often not trivial to merge these smaller datasets. Inherent challenges in some domains such as medicine also makes it very difficult to create large single source datasets or multi-source datasets with identical features. Instead of trying to merge these non-matching datasets directly, we propose a neural network architecture that can provide data augmentation using features common between these datasets. Our results show that this style of data augmentation can work for both image and tabular data.

READ FULL TEXT

page 5

page 8

page 13

page 16

research
07/20/2021

A Bayesian Approach to Invariant Deep Neural Networks

We propose a novel Bayesian neural network architecture that can learn i...
research
08/19/2022

Predicting Exotic Hadron Masses with Data Augmentation Using Multilayer Perceptron

Recently, there have been significant developments in neural networks; t...
research
03/03/2022

Data Augmentation as Feature Manipulation: a story of desert cows and grass cows

Data augmentation is a cornerstone of the machine learning pipeline, yet...
research
10/11/2018

Perfusion parameter estimation using neural networks and data augmentation

Perfusion imaging plays a crucial role in acute stroke diagnosis and tre...
research
07/21/2022

Auto Machine Learning for Medical Image Analysis by Unifying the Search on Data Augmentation and Neural Architecture

Automated data augmentation, which aims at engineering augmentation poli...
research
08/06/2020

On the Accuracy of CRNNs for Line-Based OCR: A Multi-Parameter Evaluation

We investigate how to train a high quality optical character recognition...
research
06/05/2019

On the use of Pairwise Distance Learning for Brain Signal Classification with Limited Observations

The increasing access to brain signal data using electroencephalography ...

Please sign up or login with your details

Forgot password? Click here to reset