Dreaming More Data: Class-dependent Distributions over Diffeomorphisms for Learned Data Augmentation

10/09/2015
by   Søren Hauberg, et al.
0

Data augmentation is a key element in training high-dimensional models. In this approach, one synthesizes new observations by applying pre-specified transformations to the original training data; e.g. new images are formed by rotating old ones. Current augmentation schemes, however, rely on manual specification of the applied transformations, making data augmentation an implicit form of feature engineering. With an eye towards true end-to-end learning, we suggest learning the applied transformations on a per-class basis. Particularly, we align image pairs within each class under the assumption that the spatial transformation between images belongs to a large class of diffeomorphisms. We then learn a class-specific probabilistic generative models of the transformations in a Riemannian submanifold of the Lie group of diffeomorphisms. We demonstrate significant performance improvements in training deep neural nets over manually-specified augmentation schemes. Our code and augmented datasets are available online.

READ FULL TEXT

page 3

page 4

page 5

page 7

research
09/21/2019

Adversarial Learning of General Transformations for Data Augmentation

Data augmentation (DA) is fundamental against overfitting in large convo...
research
04/07/2020

Probabilistic Spatial Transformers for Bayesian Data Augmentation

High-capacity models require vast amounts of data, and data augmentation...
research
09/06/2017

Learning to Compose Domain-Specific Transformations for Data Augmentation

Data augmentation is a ubiquitous technique for increasing the size of l...
research
06/07/2021

Rotating spiders and reflecting dogs: a class conditional approach to learning data augmentation distributions

Building invariance to non-meaningful transformations is essential to bu...
research
05/05/2021

Rethinking Ultrasound Augmentation: A Physics-Inspired Approach

Medical Ultrasound (US), despite its wide use, is characterized by artif...
research
02/07/2020

Data augmentation with Möbius transformations

Data augmentation has led to substantial improvements in the performance...
research
06/25/2021

CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals

Data augmentation is a key element of deep learning pipelines, as it inf...

Please sign up or login with your details

Forgot password? Click here to reset