MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning

08/27/2019
by   Zhijun Mai, et al.
20

MixUp is an effective data augmentation method to regularize deep neural networks via random linear interpolations between pairs of samples and their labels. It plays an important role in model regularization, semi-supervised learning and domain adaption. However, despite its empirical success, its deficiency of randomly mixing samples has poorly been studied. Since deep networks are capable of memorizing the entire dataset, the corrupted samples generated by vanilla MixUp with a badly chosen interpolation policy will degrade the performance of networks. To overcome the underfitting by corrupted samples, inspired by Meta-learning (learning to learn), we propose a novel technique of learning to mixup in this work, namely, MetaMixUp. Unlike the vanilla MixUp that samples interpolation policy from a predefined distribution, this paper introduces a meta-learning based online optimization approach to dynamically learn the interpolation policy in a data-adaptive way. The validation set performance via meta-learning captures the underfitting issue, which provides more information to refine interpolation policy. Furthermore, we adapt our method for pseudo-label based semisupervised learning (SSL) along with a refined pseudo-labeling strategy. In our experiments, our method achieves better performance than vanilla MixUp and its variants under supervised learning configuration. In particular, extensive experiments show that our MetaMixUp adapted SSL greatly outperforms MixUp and many state-of-the-art methods on CIFAR-10 and SVHN benchmarks under SSL configuration.

READ FULL TEXT

page 1

page 4

page 10

research
03/02/2023

Unsupervised Meta-Learning via Few-shot Pseudo-supervised Contrastive Learning

Unsupervised meta-learning aims to learn generalizable knowledge across ...
research
09/26/2020

Domain Generalization via Semi-supervised Meta Learning

The goal of domain generalization is to learn from multiple source domai...
research
07/05/2020

Meta-Semi: A Meta-learning Approach for Semi-supervised Learning

Deep learning based semi-supervised learning (SSL) algorithms have led t...
research
04/21/2023

Task-Adaptive Pseudo Labeling for Transductive Meta-Learning

Meta-learning performs adaptation through a limited amount of support se...
research
08/17/2022

Maximising the Utility of Validation Sets for Imbalanced Noisy-label Meta-learning

Meta-learning is an effective method to handle imbalanced and noisy-labe...
research
08/02/2023

Towards Discriminative Representation with Meta-learning for Colonoscopic Polyp Re-Identification

Colonoscopic Polyp Re-Identification aims to match the same polyp from a...
research
11/21/2019

Patch-level Neighborhood Interpolation: A General and Effective Graph-based Regularization Strategy

Regularization plays a crucial role in machine learning models, especial...

Please sign up or login with your details

Forgot password? Click here to reset