Multiple Instance Learning Convolutional Neural Networks for Object Recognition

10/11/2016
by   Miao Sun, et al.
0

Convolutional Neural Networks (CNN) have demon- strated its successful applications in computer vision, speech recognition, and natural language processing. For object recog- nition, CNNs might be limited by its strict label requirement and an implicit assumption that images are supposed to be target- object-dominated for optimal solutions. However, the labeling procedure, necessitating laying out the locations of target ob- jects, is very tedious, making high-quality large-scale dataset prohibitively expensive. Data augmentation schemes are widely used when deep networks suffer the insufficient training data problem. All the images produced through data augmentation share the same label, which may be problematic since not all data augmentation methods are label-preserving. In this paper, we propose a weakly supervised CNN framework named Multiple Instance Learning Convolutional Neural Networks (MILCNN) to solve this problem. We apply MILCNN framework to object recognition and report state-of-the-art performance on three benchmark datasets: CIFAR10, CIFAR100 and ILSVRC2015 classification dataset.

READ FULL TEXT

page 1

page 6

research
05/05/2017

Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition

Despite the impressive progress brought by deep network in visual object...
research
08/20/2017

Improving Deep Learning using Generic Data Augmentation

Deep artificial neural networks require a large corpus of training data ...
research
07/24/2015

Multimodal Deep Learning for Robust RGB-D Object Recognition

Robust object recognition is a crucial ingredient of many, if not all, r...
research
11/24/2014

Scale-Invariant Convolutional Neural Networks

Even though convolutional neural networks (CNN) has achieved near-human ...
research
02/25/2020

On Feature Normalization and Data Augmentation

Modern neural network training relies heavily on data augmentation for i...
research
04/20/2016

Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks

Brand recognition is a very challenging topic with many useful applicati...
research
10/06/2020

Fast Mesh Data Augmentation via Chebyshev Polynomial of Spectral filtering

Deep neural networks have recently been recognized as one of the powerfu...

Please sign up or login with your details

Forgot password? Click here to reset