Joining datasets via data augmentation in the label space for neural networks

06/17/2021
by   Jake Zhao, et al.
0

Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes. Unlike previous published works that ubiquitously conduct the dataset joining in the uninterpretable latent vectorial space, the core to our method is an augmentation procedure in the label space. The primary challenge to address the label space for dataset joining is the discrepancy between labels: non-overlapping label annotation sets, different labeling granularity or hierarchy and etc. Notably we propose a new technique leveraging artificially created knowledge graph, recurrent neural networks and policy gradient that successfully achieve the dataset joining in the label space. Empirical results on both image and text classification justify the validity of our approach.

READ FULL TEXT
research
07/12/2021

Fine-Grained AutoAugmentation for Multi-Label Classification

Data augmentation is a commonly used approach to improving the generaliz...
research
02/25/2020

On Feature Normalization and Data Augmentation

Modern neural network training relies heavily on data augmentation for i...
research
01/14/2016

Improved Relation Classification by Deep Recurrent Neural Networks with Data Augmentation

Nowadays, neural networks play an important role in the task of relation...
research
07/15/2019

AugLabel: Exploiting Word Representations to Augment Labels for Face Attribute Classification

Augmenting data in image space (eg. flipping, cropping etc) and activati...
research
04/20/2023

LA3: Efficient Label-Aware AutoAugment

Automated augmentation is an emerging and effective technique to search ...
research
03/11/2020

Stateful Premise Selection by Recurrent Neural Networks

In this work, we develop a new learning-based method for selecting facts...
research
06/07/2021

MixRL: Data Mixing Augmentation for Regression using Reinforcement Learning

Data augmentation is becoming essential for improving regression accurac...

Please sign up or login with your details

Forgot password? Click here to reset