Pixel-wise Attentional Gating for Parsimonious Pixel Labeling

05/03/2018
by   Shu Kong, et al.
0

To achieve parsimonious inference in per-pixel labeling tasks with a limited computational budget, we propose a Pixel-wise Attentional Gating unit (PAG) that learns to selectively process a subset of spatial locations at each layer of a deep convolutional network. PAG is a generic, architecture-independent, problem-agnostic mechanism that can be readily "plugged in" to an existing model with fine-tuning. We utilize PAG in two ways: 1) learning spatially varying pooling fields that improve model performance without the extra computation cost associated with multi-scale pooling, and 2) learning a dynamic computation policy for each pixel to decrease total computation while maintaining accuracy. We extensively evaluate PAG on a variety of per-pixel labeling tasks, including semantic segmentation, boundary detection, monocular depth and surface normal estimation. We demonstrate that PAG allows competitive or state-of-the-art performance on these tasks. Our experiments show that PAG learns dynamic spatial allocation of computation over the input image which provides better performance trade-offs compared to related approaches (e.g., truncating deep models or dynamically skipping whole layers). Generally, we observe PAG can reduce computation by 10% without noticeable loss in accuracy and performance degrades gracefully when imposing stronger computational constraints.

READ FULL TEXT

page 2

page 7

page 12

page 15

page 20

page 21

page 25

page 26

research
08/11/2016

Learning Dynamic Hierarchical Models for Anytime Scene Labeling

With increasing demand for efficient image and video analysis, test-time...
research
05/27/2015

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling

We propose a novel deep architecture, SegNet, for semantic pixel wise im...
research
02/08/2021

Semantic Segmentation with Labeling Uncertainty and Class Imbalance

Recently, methods based on Convolutional Neural Networks (CNN) achieved ...
research
11/18/2014

Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture

In this paper we address three different computer vision tasks using a s...
research
07/26/2016

Region-based semantic segmentation with end-to-end training

We propose a novel method for semantic segmentation, the task of labelin...
research
01/22/2017

A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction

The inception network has been shown to provide good performance on imag...
research
12/14/2016

Detect, Replace, Refine: Deep Structured Prediction For Pixel Wise Labeling

Pixel wise image labeling is an interesting and challenging problem with...

Please sign up or login with your details

Forgot password? Click here to reset