Learning Deep Visual Object Models From Noisy Web Data: How to Make it Work

02/28/2017
by   Nizar Massouh, et al.
0

Deep networks thrive when trained on large scale data collections. This has given ImageNet a central role in the development of deep architectures for visual object classification. However, ImageNet was created during a specific period in time, and as such it is prone to aging, as well as dataset bias issues. Moving beyond fixed training datasets will lead to more robust visual systems, especially when deployed on robots in new environments which must train on the objects they encounter there. To make this possible, it is important to break free from the need for manual annotators. Recent work has begun to investigate how to use the massive amount of images available on the Web in place of manual image annotations. We contribute to this research thread with two findings: (1) a study correlating a given level of noisily labels to the expected drop in accuracy, for two deep architectures, on two different types of noise, that clearly identifies GoogLeNet as a suitable architecture for learning from Web data; (2) a recipe for the creation of Web datasets with minimal noise and maximum visual variability, based on a visual and natural language processing concept expansion strategy. By combining these two results, we obtain a method for learning powerful deep object models automatically from the Web. We confirm the effectiveness of our approach through object categorization experiments using our Web-derived version of ImageNet on a popular robot vision benchmark database, and on a lifelong object discovery task on a mobile robot.

READ FULL TEXT

page 2

page 4

page 6

research
05/05/2017

Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition

Despite the impressive progress brought by deep network in visual object...
research
11/30/2016

Attend in groups: a weakly-supervised deep learning framework for learning from web data

Large-scale datasets have driven the rapid development of deep neural ne...
research
08/09/2017

WebVision Database: Visual Learning and Understanding from Web Data

In this paper, we present a study on learning visual recognition models ...
research
06/09/2014

Training Convolutional Networks with Noisy Labels

The availability of large labeled datasets has allowed Convolutional Net...
research
12/21/2018

Learning from Web Data: the Benefit of Unsupervised Object Localization

Annotating a large number of training images is very time-consuming. In ...
research
05/30/2018

Robust Place Categorization with Deep Domain Generalization

Traditional place categorization approaches in robot vision assume that ...
research
10/16/2017

Pushing the envelope in deep visual recognition for mobile platforms

Image classification is the task of assigning to an input image a label ...

Please sign up or login with your details

Forgot password? Click here to reset