FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing

03/30/2022
by   Rishubh Singh, et al.
0

Multi-object multi-part scene parsing is a challenging task which requires detecting multiple object classes in a scene and segmenting the semantic parts within each object. In this paper, we propose FLOAT, a factorized label space framework for scalable multi-object multi-part parsing. Our framework involves independent dense prediction of object category and part attributes which increases scalability and reduces task complexity compared to the monolithic label space counterpart. In addition, we propose an inference-time 'zoom' refinement technique which significantly improves segmentation quality, especially for smaller objects/parts. Compared to state of the art, FLOAT obtains an absolute improvement of 2.0 segmentation quality IOU (sqIOU) on the Pascal-Part-58 dataset. For the larger Pascal-Part-108 dataset, the improvements are 2.1 We incorporate previously excluded part attributes and other minor parts of the Pascal-Part dataset to create the most comprehensive and challenging version which we dub Pascal-Part-201. FLOAT obtains improvements of 8.6 7.5 across a challenging diversity of objects and parts. The code and datasets are available at floatseg.github.io.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 8

page 15

page 16

page 17

research
08/18/2016

Semantic Understanding of Scenes through the ADE20K Dataset

Scene parsing, or recognizing and segmenting objects and stuff in an ima...
research
03/28/2017

Objects as context for detecting their semantic parts

We present a semantic part detection approach that effectively leverages...
research
05/10/2015

Deep Learning for Semantic Part Segmentation with High-Level Guidance

In this work we address the task of segmenting an object into its parts,...
research
12/27/2018

Finite State Machines for Semantic Scene Parsing and Segmentation

We introduce in this work a novel stochastic inference process, for scen...
research
07/17/2020

GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild

The semantic segmentation of parts of objects in the wild is a challengi...
research
04/16/2020

Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding

In this technical report, we present two novel datasets for image scene ...
research
06/11/2021

Part-aware Panoptic Segmentation

In this work, we introduce the new scene understanding task of Part-awar...

Please sign up or login with your details

Forgot password? Click here to reset