Semantic Understanding of Scenes through the ADE20K Dataset

08/18/2016
by   Bolei Zhou, et al.
0

Scene parsing, or recognizing and segmenting objects and stuff in an image, is one of the key problems in computer vision. Despite the community's efforts in data collection, there are still few image datasets covering a wide range of scenes and object categories with dense and detailed annotations for scene parsing. In this paper, we introduce and analyze the ADE20K dataset, spanning diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts. A generic network design called Cascade Segmentation Module is then proposed to enable the segmentation networks to parse a scene into stuff, objects, and object parts in a cascade. We evaluate the proposed module integrated within two existing semantic segmentation networks, yielding significant improvements for scene parsing. We further show that the scene parsing networks trained on ADE20K can be applied to a wide variety of scenes and objects.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 7

page 9

page 12

research
07/26/2018

Unified Perceptual Parsing for Scene Understanding

Humans recognize the visual world at multiple levels: we effortlessly ca...
research
03/30/2022

FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing

Multi-object multi-part scene parsing is a challenging task which requir...
research
03/26/2017

Open Vocabulary Scene Parsing

Recognizing arbitrary objects in the wild has been a challenging problem...
research
10/19/2018

Synscapes: A Photorealistic Synthetic Dataset for Street Scene Parsing

We introduce Synscapes -- a synthetic dataset for street scene parsing c...
research
06/11/2021

Part-aware Panoptic Segmentation

In this work, we introduce the new scene understanding task of Part-awar...
research
07/05/2016

Learning the semantic structure of objects from Web supervision

While recent research in image understanding has often focused on recogn...
research
01/09/2017

Information Pursuit: A Bayesian Framework for Sequential Scene Parsing

Despite enormous progress in object detection and classification, the pr...

Please sign up or login with your details

Forgot password? Click here to reset