Iterative Scene Graph Generation

07/27/2022
by   Siddhesh Khandelwal, et al.
0

The task of scene graph generation entails identifying object entities and their corresponding interaction predicates in a given image (or video). Due to the combinatorially large solution space, existing approaches to scene graph generation assume certain factorization of the joint distribution to make the estimation feasible (e.g., assuming that objects are conditionally independent of predicate predictions). However, this fixed factorization is not ideal under all scenarios (e.g., for images where an object entailed in interaction is small and not discernible on its own). In this work, we propose a novel framework for scene graph generation that addresses this limitation, as well as introduces dynamic conditioning on the image, using message passing in a Markov Random Field. This is implemented as an iterative refinement procedure wherein each modification is conditioned on the graph generated in the previous iteration. This conditioning across refinement steps allows joint reasoning over entities and relations. This framework is realized via a novel and end-to-end trainable transformer-based architecture. In addition, the proposed framework can improve existing approach performance. Through extensive experiments on Visual Genome and Action Genome benchmark datasets we show improved performance on the scene graph generation.

READ FULL TEXT

page 16

page 17

page 18

page 19

page 20

page 21

page 22

research
01/10/2017

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects ...
research
12/16/2019

Learning Canonical Representations for Scene Graph to Image Generation

Generating realistic images of complex visual scenes becomes very challe...
research
11/30/2022

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by enc...
research
08/10/2023

Local-Global Information Interaction Debiasing for Dynamic Scene Graph Generation

The task of dynamic scene graph generation (DynSGG) aims to generate sce...
research
02/02/2023

Self-Supervised Relation Alignment for Scene Graph Generation

The goal of scene graph generation is to predict a graph from an input i...
research
04/01/2021

Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation

Scene graph generation is an important visual understanding task with a ...
research
08/17/2022

Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning

Scene graph generation (SGG) is a fundamental task aimed at detecting vi...

Please sign up or login with your details

Forgot password? Click here to reset