Learning Rich Representations For Structured Visual Prediction Tasks

08/30/2019
by   Mohammadreza Mostajabi, et al.
11

We describe an approach to learning rich representations for images, that enables simple and effective predictors in a range of vision tasks involving spatially structured maps. Our key idea is to map small image elements to feature representations extracted from a sequence of nested regions of increasing spatial extent. These regions are obtained by "zooming out" from the pixel/superpixel all the way to scene-level resolution, and hence we call these zoom-out features. Applied to semantic segmentation and other structured prediction tasks, our approach exploits statistical structure in the image and in the label space without setting up explicit structured prediction mechanisms, and thus avoids complex and expensive inference. Instead image elements are classified by a feedforward multilayer network with skip-layer connections spanning the zoom-out levels. When used in conjunction with modern neural architectures such as ResNet, DenseNet and NASNet (to which it is complementary) our approach achieves competitive accuracy on segmentation benchmarks. In addition, we propose an approach for learning category-level semantic segmentation purely from image-level classification tag. It exploits localization cues that emerge from training a modified zoom-out architecture tailored for classification tasks, to drive a weakly supervised process that automatically labels a sparse, diverse training set of points likely to belong to classes of interest. Finally, we introduce data-driven regularization functions for the supervised training of CNNs. Our innovation takes the form of a regularizer derived by learning an autoencoder over the set of annotations. This approach leverages an improved representation of label space to inform extraction of features from images

READ FULL TEXT

page 2

page 20

page 23

page 29

page 33

page 34

research
12/02/2014

Feedforward semantic segmentation with zoom-out features

We introduce a purely feed-forward architecture for semantic segmentatio...
research
12/06/2016

Diverse Sampling for Self-Supervised Learning of Semantic Segmentation

We propose an approach for learning category-level semantic segmentation...
research
03/04/2023

Exploit CAM by itself: Complementary Learning System for Weakly Supervised Semantic Segmentation

Weakly Supervised Semantic Segmentation (WSSS) with image-level labels h...
research
04/07/2020

Manifold-driven Attention Maps for Weakly Supervised Segmentation

Segmentation using deep learning has shown promising directions in medic...
research
04/03/2019

Fully Using Classifiers for Weakly Supervised Semantic Segmentation with Modified Cues

This paper proposes a novel weakly-supervised semantic segmentation meth...
research
02/01/2018

Learning random-walk label propagation for weakly-supervised semantic segmentation

Large-scale training for semantic segmentation is challenging due to the...
research
07/09/2021

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for deca...

Please sign up or login with your details

Forgot password? Click here to reset