Panoptic Scene Graph Generation with Semantics-prototype Learning

07/28/2023
by   Li Li, et al.
0

Panoptic Scene Graph Generation (PSG) parses objects and predicts their relationships (predicate) to connect human language and visual scenes. However, different language preferences of annotators and semantic overlaps between predicates lead to biased predicate annotations in the dataset, i.e. different predicates for same object pairs. Biased predicate annotations make PSG models struggle in constructing a clear decision plane among predicates, which greatly hinders the real application of PSG models. To address the intrinsic bias above, we propose a novel framework named ADTrans to adaptively transfer biased predicate annotations to informative and unified ones. To promise consistency and accuracy during the transfer process, we propose to measure the invariance of representations in each predicate class, and learn unbiased prototypes of predicates with different intensities. Meanwhile, we continuously measure the distribution changes between each presentation and its prototype, and constantly screen potential biased data. Finally, with the unbiased predicate-prototype representation embedding space, biased annotations are easily identified. Experiments show that ADTrans significantly improves the performance of benchmark models, achieving a new state-of-the-art performance, and shows great generalization and effectiveness on multiple datasets.

READ FULL TEXT

page 1

page 6

page 8

research
09/16/2020

CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation

Scene graphs are semantic abstraction of images that encourage visual un...
research
07/30/2023

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

Video-based scene graph generation (VidSGG) is an approach that aims to ...
research
03/13/2023

Prototype-based Embedding Network for Scene Graph Generation

Current Scene Graph Generation (SGG) methods explore contextual informat...
research
04/01/2019

Scene Graph Generation with External Knowledge and Image Reconstruction

Scene graph generation has received growing attention with the advanceme...
research
07/27/2021

Greedy Gradient Ensemble for Robust Visual Question Answering

Language bias is a critical issue in Visual Question Answering (VQA), wh...
research
10/03/2022

Unbiased Scene Graph Generation using Predicate Similarities

Scene Graphs are widely applied in computer vision as a graphical repres...
research
03/18/2022

Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation

Scene Graph Generation, which generally follows a regular encoder-decode...

Please sign up or login with your details

Forgot password? Click here to reset