Pyramid Scene Parsing Network

12/04/2016
by   Hengshuang Zhao, et al.
0

Scene parsing is challenging for unrestricted open vocabulary and diverse scenes. In this paper, we exploit the capability of global context information by different-region-based context aggregation through our pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet). Our global prior representation is effective to produce good quality results on the scene parsing task, while PSPNet provides a superior framework for pixel-level prediction tasks. The proposed approach achieves state-of-the-art performance on various datasets. It came first in ImageNet scene parsing challenge 2016, PASCAL VOC 2012 benchmark and Cityscapes benchmark. A single PSPNet yields new record of mIoU accuracy 85.4 Cityscapes.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

page 9

research
05/25/2018

Pyramid Attention Network for Semantic Segmentation

A Pyramid Attention Network(PAN) is proposed to exploit the impact of gl...
research
09/04/2018

OCNet: Object Context Network for Scene Parsing

Context is essential for various computer vision tasks. The state-of-the...
research
04/17/2019

CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing

Objects in an image exhibit diverse scales. Adaptive receptive fields ar...
research
07/29/2019

Consensus Feature Network for Scene Parsing

Scene parsing is challenging as it aims to assign one of the semantic ca...
research
08/30/2022

Boosting Night-time Scene Parsing with Learnable Frequency

Night-Time Scene Parsing (NTSP) is essential to many vision applications...
research
03/26/2017

Open Vocabulary Scene Parsing

Recognizing arbitrary objects in the wild has been a challenging problem...
research
12/01/2016

Video Scene Parsing with Predictive Feature Learning

In this work, we address the challenging video scene parsing problem by ...

Please sign up or login with your details

Forgot password? Click here to reset