Unified Perceptual Parsing for Scene Understanding

07/26/2018
by   Tete Xiao, et al.
2

Humans recognize the visual world at multiple levels: we effortlessly categorize scenes and detect objects inside, while also identifying the textures and surfaces of the objects along with their different compositional parts. In this paper, we study a new task called Unified Perceptual Parsing, which requires the machine vision systems to recognize as many visual concepts as possible from a given image. A multi-task framework called UPerNet and a training strategy are developed to learn from heterogeneous image annotations. We benchmark our framework on Unified Perceptual Parsing and show that it is able to effectively segment a wide range of concepts from images. The trained networks are further applied to discover visual knowledge in natural scenes. Models are available at <https://github.com/CSAILVision/unifiedparsing>.

READ FULL TEXT

page 2

page 5

page 6

page 12

research
08/18/2016

Semantic Understanding of Scenes through the ADE20K Dataset

Scene parsing, or recognizing and segmenting objects and stuff in an ima...
research
03/25/2020

Deep Grouping Model for Unified Perceptual Parsing

The perceptual-based grouping process produces a hierarchical and compos...
research
11/04/2021

Unsupervised Learning of Compositional Energy Concepts

Humans are able to rapidly understand scenes by utilizing concepts extra...
research
03/26/2017

Open Vocabulary Scene Parsing

Recognizing arbitrary objects in the wild has been a challenging problem...
research
03/18/2023

Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning

Humans have the innate capability to answer diverse questions, which is ...
research
03/16/2020

Synthesizing human-like sketches from natural images using a conditional convolutional decoder

Humans are able to precisely communicate diverse concepts by employing s...
research
10/20/2022

VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum Modeling

Recently, 3D scenes parsing with deep learning approaches has been a hea...

Please sign up or login with your details

Forgot password? Click here to reset