Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

11/24/2021
by   Xiaoxue Chen, et al.
2

Multi-task indoor scene understanding is widely considered as an intriguing formulation, as the affinity of different tasks may lead to improved performance. In this paper, we tackle the new problem of joint semantic, affordance and attribute parsing. However, successfully resolving it requires a model to capture long-range dependency, learn from weakly aligned data and properly balance sub-tasks during training. To this end, we propose an attention-based architecture named Cerberus and a tailored training framework. Our method effectively addresses the aforementioned challenges and achieves state-of-the-art performance on all three tasks. Moreover, an in-depth analysis shows concept affinity consistent with human cognition, which inspires us to explore the possibility of weakly supervised learning. Surprisingly, Cerberus achieves strong results using only 0.1 confirm that this success is credited to common attention maps across tasks. Code and models can be accessed at https://github.com/OPEN-AIR-SUN/Cerberus.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 13

page 14

page 15

research
04/03/2023

WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation

This paper explores the properties of the plain Vision Transformer (ViT)...
research
06/08/2021

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation is receiving great attention due...
research
03/18/2023

Spatial-Aware Token for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) is a challenging task aimin...
research
03/06/2022

Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

This paper proposes a new transformer-based framework to learn class-spe...
research
04/22/2017

Deep Multitask Learning for Semantic Dependency Parsing

We present a deep neural architecture that parses sentences into three s...
research
01/07/2023

"It's a Match!" – A Benchmark of Task Affinity Scores for Joint Learning

While the promises of Multi-Task Learning (MTL) are attractive, characte...
research
06/26/2019

Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis

Cognitive task analysis (CTA) is a type of analysis in applied psycholog...

Please sign up or login with your details

Forgot password? Click here to reset