Albumentations: fast and flexible image augmentations

09/18/2018
by   Alexander Buslaev, et al.
0

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve output labels. In computer vision domain, image augmentations have become a common implicit regularization technique to combat overfitting in deep convolutional neural networks and are ubiquitously used to improve performance. While most deep learning frameworks implement basic image transformations, the list is typically limited to some variations and combinations of flipping, rotating, scaling, and cropping. Moreover, the image processing speed varies in existing tools for image augmentation. We present Albumentations, a fast and flexible library for image augmentations with many various image transform operations available, that is also an easy-to-use wrapper around other augmentation libraries. We provide examples of image augmentations for different computer vision tasks and show that Albumentations is faster than other commonly used image augmentation tools on the most of commonly used image transformations. The source code for Albumentations is made publicly available online at https://github.com/albu/albumentations

READ FULL TEXT

page 2

page 3

research
07/30/2019

Safe Augmentation: Learning Task-Specific Transformations from Data

Data augmentation is widely used as a part of the training process appli...
research
03/13/2020

Random smooth gray value transformations for cross modality learning with gray value invariant networks

Random transformations are commonly used for augmentation of the trainin...
research
10/20/2021

Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Data augmentation reduces the generalization error by forcing a model to...
research
12/28/2020

Data augmentation and image understanding

Interdisciplinary research is often at the core of scientific progress. ...
research
07/17/2022

Performance degradation of ImageNet trained models by simple image transformations

ImageNet trained PyTorch models are generally preferred as the off-the-s...
research
05/03/2022

A Comprehensive Survey of Image Augmentation Techniques for Deep Learning

Deep learning has been achieving decent performance in computer vision r...
research
06/29/2023

Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Implicit Neural Representation (INR) is an innovative approach for repre...

Please sign up or login with your details

Forgot password? Click here to reset