MixRL: Data Mixing Augmentation for Regression using Reinforcement Learning

06/07/2021
by   Seong-Hyeon Hwang, et al.
0

Data augmentation is becoming essential for improving regression accuracy in critical applications including manufacturing and finance. Existing techniques for data augmentation largely focus on classification tasks and do not readily apply to regression tasks. In particular, the recent Mixup techniques for classification rely on the key assumption that linearity holds among training examples, which is reasonable if the label space is discrete, but has limitations when the label space is continuous as in regression. We show that mixing examples that either have a large data or label distance may have an increasingly-negative effect on model performance. Hence, we use the stricter assumption that linearity only holds within certain data or label distances for regression where the degree may vary by each example. We then propose MixRL, a data augmentation meta learning framework for regression that learns for each example how many nearest neighbors it should be mixed with for the best model performance using a small validation set. MixRL achieves these objectives using Monte Carlo policy gradient reinforcement learning. Our experiments conducted both on synthetic and real datasets show that MixRL significantly outperforms state-of-the-art data augmentation baselines. MixRL can also be integrated with other classification Mixup techniques for better results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2018

Improved Mixed-Example Data Augmentation

In order to reduce overfitting, neural networks are typically trained wi...
research
07/12/2021

Fine-Grained AutoAugmentation for Multi-Label Classification

Data augmentation is a commonly used approach to improving the generaliz...
research
08/05/2023

MiAMix: Enhancing Image Classification through a Multi-stage Augmented Mixied Sample Data Augmentation Method

Despite substantial progress in the field of deep learning, overfitting ...
research
09/07/2018

Learning Invariances for Policy Generalization

While recent progress has spawned very powerful machine learning systems...
research
06/16/2021

ParticleAugment: Sampling-Based Data Augmentation

We present an automated data augmentation approach for image classificat...
research
07/12/2018

Hydranet: Data Augmentation for Regression Neural Networks

Despite recent efforts, deep learning techniques remain often heavily de...
research
06/17/2021

Joining datasets via data augmentation in the label space for neural networks

Most, if not all, modern deep learning systems restrict themselves to a ...

Please sign up or login with your details

Forgot password? Click here to reset