Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction

02/01/2020
by   Bin Wen, et al.
11

Extracting graph representation of visual scenes in image is a challenging task in computer vision. Although there has been encouraging progress of scene graph generation in the past decade, we surprisingly find that the performance of existing approaches is largely limited by the strong biases, which mainly stem from (1) unconsciously assuming relations with certain semantic properties such as symmetric and (2) imbalanced annotations over different relations. To alleviate the negative effects of these biases, we proposed a new and simple architecture named Rich and Fair semantic extraction network (RiFa for short), to not only capture rich semantic properties of the relations, but also fairly predict relations with different scale of annotations. Using pseudo-siamese networks, RiFa embeds the subject and object respectively to distinguish their semantic differences and meanwhile preserve their underlying semantic properties. Then, it further predicts subject-object relations based on both the visual and semantic features of entities under certain contextual area, and fairly ranks the relation predictions for those with a few annotations. Experiments on the popular Visual Genome dataset show that RiFa achieves state-of-the-art performance under several challenging settings of scene graph task. Especially, it performs significantly better on capturing different semantic properties of relations, and obtains the best overall per relation performance.

READ FULL TEXT

page 2

page 8

research
05/28/2019

Union Visual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

Relations amongst entities play a central role in image understanding. D...
research
03/17/2022

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation

Scene graph generation is a sophisticated task because there is no speci...
research
09/19/2016

On Support Relations and Semantic Scene Graphs

Scene understanding is a popular and challenging topic in both computer ...
research
03/28/2023

HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation

Panoptic Scene Graph generation (PSG) is a recently proposed task in ima...
research
01/14/2020

NODIS: Neural Ordinary Differential Scene Understanding

Semantic image understanding is a challenging topic in computer vision. ...
research
08/24/2023

SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data

We propose Subject-Conditional Relation Detection SCoRD, where condition...
research
11/17/2017

Neural Motifs: Scene Graph Parsing with Global Context

We investigate the problem of producing structured graph representations...

Please sign up or login with your details

Forgot password? Click here to reset