Unbiased Scene Graph Generation using Predicate Similarities

10/03/2022
by   Misaki Ohashi, et al.
0

Scene Graphs are widely applied in computer vision as a graphical representation of relationships between objects shown in images. However, these applications have not yet reached a practical stage of development owing to biased training caused by long-tailed predicate distributions. In recent years, many studies have tackled this problem. In contrast, relatively few works have considered predicate similarities as a unique dataset feature which also leads to the biased prediction. Due to the feature, infrequent predicates (e.g., parked on, covered in) are easily misclassified as closely-related frequent predicates (e.g., on, in). Utilizing predicate similarities, we propose a new classification scheme that branches the process to several fine-grained classifiers for similar predicate groups. The classifiers aim to capture the differences among similar predicates in detail. We also introduce the idea of transfer learning to enhance the features for the predicates which lack sufficient training samples to learn the descriptive representations. The results of extensive experiments on the Visual Genome dataset show that the combination of our method and an existing debiasing approach greatly improves performance on tail predicates in challenging SGCls/SGDet tasks. Nonetheless, the overall performance of the proposed approach does not reach that of the current state of the art, so further analysis remains necessary as future work.

READ FULL TEXT
research
06/13/2020

Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation

Despite the huge progress in scene graph generation in recent years, its...
research
06/16/2023

Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation

This paper investigates the problem of scene graph generation in videos ...
research
05/30/2023

Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation

Learning to compose visual relationships from raw images in the form of ...
research
07/05/2021

Recovering the Unbiased Scene Graphs from the Biased Ones

Given input images, scene graph generation (SGG) aims to produce compreh...
research
07/28/2023

Panoptic Scene Graph Generation with Semantics-prototype Learning

Panoptic Scene Graph Generation (PSG) parses objects and predicts their ...
research
02/22/2018

Deep Unsupervised Learning of Visual Similarities

Exemplar learning of visual similarities in an unsupervised manner is a ...
research
09/16/2020

CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation

Scene graphs are semantic abstraction of images that encourage visual un...

Please sign up or login with your details

Forgot password? Click here to reset