Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

02/26/2018
by   Weifeng Ge, et al.
0

Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy achieved by top weakly supervised algorithms is still significantly lower than their fully supervised counterparts. In this paper, we propose a novel weakly supervised curriculum learning pipeline for multi-label object recognition, detection and semantic segmentation. In this pipeline, we first obtain intermediate object localization and pixel labeling results for the training images, and then use such results to train task-specific deep networks in a fully supervised manner. The entire process consists of four stages, including object localization in the training images, filtering and fusing object instances, pixel labeling for the training images, and task-specific network training. To obtain clean object instances in the training images, we propose a novel algorithm for filtering, fusing and classifying object instances collected from multiple solution mechanisms. In this algorithm, we incorporate both metric learning and density-based clustering to filter detected object instances. Experiments show that our weakly supervised pipeline achieves state-of-the-art results in multi-label image classification as well as weakly supervised object detection and very competitive results in weakly supervised semantic segmentation on MS-COCO, PASCAL VOC 2007 and PASCAL VOC 2012.

READ FULL TEXT

page 3

page 4

page 6

page 7

research
09/16/2018

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Multi-label image classification is a fundamental but challenging task t...
research
10/17/2020

LID 2020: The Learning from Imperfect Data Challenge Results

Learning from imperfect data becomes an issue in many industrial applica...
research
11/23/2014

From Image-level to Pixel-level Labeling with Convolutional Networks

We are interested in inferring object segmentation by leveraging only ob...
research
04/08/2016

Deep Structured Scene Parsing by Learning with Image Descriptions

This paper addresses a fundamental problem of scene understanding: How t...
research
10/07/2019

Label-PEnet: Sequential Label Propagation and Enhancement Networks forWeakly Supervised Instance Segmentation

Weakly-supervised instance segmentation aims to detect and segment objec...
research
10/26/2020

A Centroid Loss for Weakly Supervised Semantic Segmentation in Quality Control and Inspection Application

Process automation has enabled a level of accuracy and productivity that...
research
03/07/2022

Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition

The goal of unpaired image captioning (UIC) is to describe images withou...

Please sign up or login with your details

Forgot password? Click here to reset