Automated Image Data Preprocessing with Deep Reinforcement Learning

06/15/2018
by   Tran Ngoc Minh, et al.
2

Data preparation, i.e. the process of transforming raw data into a format that can be used for training effective machine learning models, is a tedious and time-consuming task. For image data, preprocessing typically involves a sequence of basic transformations such as cropping, filtering, rotating or flipping images. Currently, data scientists decide manually based on their experience which transformations to apply in which particular order to a given image data set. Besides constituting a bottleneck in real-world data science projects, manual image data preprocessing may yield suboptimal results as data scientists need to rely on intuition or trial-and-error approaches when exploring the space of possible image transformations and thus might not be able to discover the most effective ones. To mitigate the inefficiency and potential ineffectiveness of manual data preprocessing, this paper proposes a deep reinforcement learning framework to automatically discover the optimal data preprocessing steps for training an image classifier. The framework takes as input sets of labeled images and predefined preprocessing transformations. It jointly learns the classifier and the optimal preprocessing transformations for individual images. Experimental results show that the proposed approach not only improves the accuracy of image classifiers, but also makes them substantially more robust to noisy inputs at test time.

READ FULL TEXT

page 6

page 8

research
01/06/2022

Deep Learning Based Classification System For Recognizing Local Spinach

A deep learning model gives an incredible result for image processing by...
research
08/20/2023

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data

Data preprocessing is a crucial step in the machine learning process tha...
research
04/08/2020

Estimating Grape Yield on the Vine from Multiple Images

Estimating grape yield prior to harvest is important to commercial viney...
research
08/24/2018

Building a Robust Text Classifier on a Test-Time Budget

We propose a generic and interpretable learning framework for building r...
research
03/18/2021

Image Synthesis for Data Augmentation in Medical CT using Deep Reinforcement Learning

Deep learning has shown great promise for CT image reconstruction, in pa...
research
12/10/2019

Enhancing Learnability of classification algorithms using simple data preprocessing in fMRI scans of Alzheimer's disease

Alzheimer's Disease (AD) is the most common type of dementia. In all lea...
research
10/07/2019

Push it to the Limit: Discover Edge-Cases in Image Data with Autoencoders

In this paper, we focus on the problem of identifying semantic factors o...

Please sign up or login with your details

Forgot password? Click here to reset