Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

03/30/2020
by   Qibin Hou, et al.
7

Spatial pooling has been proven highly effective in capturing long-range contextual information for pixel-wise prediction tasks, such as scene parsing. In this paper, beyond conventional spatial pooling that usually has a regular shape of NxN, we rethink the formulation of spatial pooling by introducing a new pooling strategy, called strip pooling, which considers a long but narrow kernel, i.e., 1xN or Nx1. Based on strip pooling, we further investigate spatial pooling architecture design by 1) introducing a new strip pooling module that enables backbone networks to efficiently model long-range dependencies, 2) presenting a novel building block with diverse spatial pooling as a core, and 3) systematically comparing the performance of the proposed strip pooling and conventional spatial pooling techniques. Both novel pooling-based designs are lightweight and can serve as an efficient plug-and-play module in existing scene parsing networks. Extensive experiments on popular benchmarks (e.g., ADE20K and Cityscapes) demonstrate that our simple approach establishes new state-of-the-art results. Code is made available at https://github.com/Andrew-Qibin/SPNet.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 9

page 10

research
01/11/2021

ORDNet: Capturing Omni-Range Dependencies for Scene Parsing

Learning to capture dependencies between spatial positions is essential ...
research
03/13/2023

Designing Deep Networks for Scene Recognition

Most deep learning backbones are evaluated on ImageNet. Using scenery im...
research
11/16/2016

S3Pool: Pooling with Stochastic Spatial Sampling

Feature pooling layers (e.g., max pooling) in convolutional neural netwo...
research
08/29/2021

Layout-to-Image Translation with Double Pooling Generative Adversarial Networks

In this paper, we address the task of layout-to-image translation, which...
research
09/08/2022

Lightweight Long-Range Generative Adversarial Networks

In this paper, we introduce novel lightweight generative adversarial net...
research
03/28/2022

Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

This paper probes intrinsic factors behind typical failure cases (e.g. s...
research
05/25/2021

Fast and Accurate Scene Parsing via Bi-direction Alignment Networks

In this paper, we propose an effective method for fast and accurate scen...

Please sign up or login with your details

Forgot password? Click here to reset