CNN: Single-label to Multi-label

06/22/2014
by   Yunchao Wei, et al.
0

Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks. However, how CNN best copes with multi-label images still remains an open problem, mainly due to the complex underlying object layouts and insufficient multi-label training images. In this work, we propose a flexible deep CNN infrastructure, called Hypotheses-CNN-Pooling (HCP), where an arbitrary number of object segment hypotheses are taken as the inputs, then a shared CNN is connected with each hypothesis, and finally the CNN output results from different hypotheses are aggregated with max pooling to produce the ultimate multi-label predictions. Some unique characteristics of this flexible deep CNN infrastructure include: 1) no ground truth bounding box information is required for training; 2) the whole HCP infrastructure is robust to possibly noisy and/or redundant hypotheses; 3) no explicit hypothesis label is required; 4) the shared CNN may be well pre-trained with a large-scale single-label image dataset, e.g. ImageNet; and 5) it may naturally output multi-label prediction results. Experimental results on Pascal VOC2007 and VOC2012 multi-label image datasets well demonstrate the superiority of the proposed HCP infrastructure over other state-of-the-arts. In particular, the mAP reaches 84.2 after the fusion with our complementary result in [47] based on hand-crafted features on the VOC2012 dataset, which significantly outperforms the state-of-the-arts with a large margin of more than 7

READ FULL TEXT

page 2

page 3

page 4

page 8

page 9

page 12

research
01/15/2018

Multi-Label Learning from Medical Plain Text with Convolutional Residual Models

Predicting diagnoses from Electronic Health Records (EHRs) is an importa...
research
09/21/2017

Multi-label Pixelwise Classification for Reconstruction of Large-scale Urban Areas

Object classification is one of the many holy grails in computer vision ...
research
04/22/2015

Exploit Bounding Box Annotations for Multi-label Object Recognition

Convolutional neural networks (CNNs) have shown great performance as gen...
research
01/07/2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

In existing visual representation learning tasks, deep convolutional neu...
research
12/04/2016

Multi-Label Image Classification with Regional Latent Semantic Dependencies

Deep convolution neural networks (CNN) have demonstrated advanced perfor...
research
06/27/2012

Maximum Margin Output Coding

In this paper we study output coding for multi-label prediction. For a m...
research
11/19/2016

Inferring Restaurant Styles by Mining Crowd Sourced Photos from User-Review Websites

When looking for a restaurant online, user uploaded photos often give pe...

Please sign up or login with your details

Forgot password? Click here to reset