Automatic Dataset Augmentation

08/28/2017
by   Yalong Bai, et al.
0

Large scale image dataset and deep convolutional neural network (DCNN) are two primary driving forces for the rapid progress made in generic object recognition tasks in recent years. While lots of network architectures have been continuously designed to pursue lower error rates, few efforts are devoted to enlarge existing datasets due to high labeling cost and unfair comparison issues. In this paper, we aim to achieve lower error rate by augmenting existing datasets in an automatic manner. Our method leverages both Web and DCNN, where Web provides massive images with rich contextual information, and DCNN replaces human to automatically label images under guidance of Web contextual information. Experiments show our method can automatically scale up existing datasets significantly from billions web pages with high accuracy, and significantly improve the performance on object recognition tasks by using the automatically augmented datasets, which demonstrates that more supervisory information has been automatically gathered from the Web. Both the dataset and models trained on the dataset are made publicly available.

READ FULL TEXT

page 4

page 5

page 6

page 10

research
03/06/2018

Learning Scene Gist with Convolutional Neural Networks to Improve Object Recognition

Advancements in convolutional neural networks (CNNs) have made significa...
research
12/19/2013

Using Web Co-occurrence Statistics for Improving Image Categorization

Object recognition and localization are important tasks in computer visi...
research
12/20/2014

Self-informed neural network structure learning

We study the problem of large scale, multi-label visual recognition with...
research
05/18/2020

Webpage Segmentation for Extracting Images and Their Surrounding Contextual Information

Web images come in hand with valuable contextual information. Although t...
research
05/03/2018

SdcNet: A Computation-Efficient CNN for Object Recognition

Extracting features from a huge amount of data for object recognition is...
research
04/29/2020

Image understanding and the web

The contextual information of Web images is investigated to address the ...
research
11/22/2017

Context Augmentation for Convolutional Neural Networks

Recent enhancements of deep convolutional neural networks (ConvNets) emp...

Please sign up or login with your details

Forgot password? Click here to reset