BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

09/11/2021
by   Naina Dhingra, et al.
0

Scene graphs are nodes and edges consisting of objects and object-object relationships, respectively. Scene graph generation (SGG) aims to identify the objects and their relationships. We propose a bidirectional GRU (BiGRU) transformer network (BGT-Net) for the scene graph generation for images. This model implements novel object-object communication to enhance the object information using a BiGRU layer. Thus, the information of all objects in the image is available for the other objects, which can be leveraged later in the object prediction step. This object information is used in a transformer encoder to predict the object class as well as to create object-specific edge information via the use of another transformer encoder. To handle the dataset bias induced by the long-tailed relationship distribution, softening with a log-softmax function and adding a bias adaptation term to regulate the bias for every relation prediction individually showed to be an effective approach. We conducted an elaborate study on experiments and ablations using open-source datasets, i.e., Visual Genome, Open-Images, and Visual Relationship Detection datasets, demonstrating the effectiveness of the proposed model over state of the art.

READ FULL TEXT

page 1

page 3

page 16

research
11/09/2022

SG-Shuffle: Multi-aspect Shuffle Transformer for Scene Graph Generation

Scene Graph Generation (SGG) serves a comprehensive representation of th...
research
02/22/2022

Relation Regularized Scene Graph Generation

Scene graph generation (SGG) is built on top of detected objects to pred...
research
01/18/2022

Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation

Scene Graph Generation (SGG) aims to build a structured representation o...
research
04/07/2023

Devil's on the Edges: Selective Quad Attention for Scene Graph Generation

Scene graph generation aims to construct a semantic graph structure from...
research
03/20/2023

Revisiting Transformer for Point Cloud-based 3D Scene Graph Generation

In this paper, we propose the semantic graph Transformer (SGT) for the 3...
research
03/29/2020

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

Scene graph generation (SGG) aims to detect objects in an image along wi...
research
04/13/2020

Relation Transformer Network

The identification of objects in an image, together with their mutual re...

Please sign up or login with your details

Forgot password? Click here to reset