RelTR: Relation Transformer for Scene Graph Generation

01/27/2022
by   Yuren Cong, et al.
14

Different objects in the same scene are more or less related to each other, but only a limited number of these relationships are noteworthy. Inspired by DETR, which excels in object detection, we view scene graph generation as a set prediction problem and propose an end-to-end scene graph generation model RelTR which has an encoder-decoder architecture. The encoder reasons about the visual feature context while the decoder infers a fixed-size set of triplets subject-predicate-object using different types of attention mechanisms with coupled subject and object queries. We design a set prediction loss performing the matching between the ground truth and predicted triplets for the end-to-end training. In contrast to most existing scene graph generation methods, RelTR is a one-stage method that predicts a set of relationships directly only using visual appearance without combining entities and labeling all possible predicates. Extensive experiments on the Visual Genome and Open Images V6 datasets demonstrate the superior performance and fast inference of our model.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 10

page 11

research
06/09/2023

Single-Stage Visual Relationship Learning using Conditional Queries

Research in scene graph generation (SGG) usually considers two-stage mod...
research
12/19/2022

SrTR: Self-reasoning Transformer with Visual-linguistic Knowledge for Scene Graph Generation

Objects in a scene are not always related. The execution efficiency of t...
research
08/18/2023

Vision Relation Transformer for Unbiased Scene Graph Generation

Recent years have seen a growing interest in Scene Graph Generation (SGG...
research
03/13/2023

Prototype-based Embedding Network for Scene Graph Generation

Current Scene Graph Generation (SGG) methods explore contextual informat...
research
01/18/2023

DDS: Decoupled Dynamic Scene-Graph Generation Network

Scene-graph generation involves creating a structural representation of ...
research
03/07/2019

Graphical Contrastive Losses for Scene Graph Generation

Most scene graph generators use a two-stage pipeline to detect visual re...
research
09/06/2023

RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

Scene Graph Generation (SGG) has achieved significant progress recently....

Please sign up or login with your details

Forgot password? Click here to reset