Attentive Relational Networks for Mapping Images to Scene Graphs

11/26/2018
by   Mengshi Qi, et al.
0

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted objects and their interaction relationships. Despite the recent successes in object detection using deep learning techniques, inferring complex contextual relationships and structured graph representations from visual data remains a challenging topic. In this study, we propose a novel Attentive Relational Network that consists of two key modules with an object detection backbone to approach this problem. The first module is a semantic transformation module used to capture semantic embedded relation features, by translating visual features and linguistic features into a common semantic space. The other module is a graph self-attention module introduced to embed a joint graph representation through assigning various importance weights to neighboring nodes. Finally, accurate scene graphs are produced with the relation inference module by recognizing all entities and the corresponding relations. We evaluate our proposed method on the widely-adopted Visual Genome Dataset, and the results demonstrate the effectiveness and superiority of our model.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
08/12/2020

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation

Scene graph generation aims to produce structured representations for im...
research
08/19/2021

Exploiting Scene Graphs for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a fundamental visual task ai...
research
04/03/2019

Exploring the Semantics for Visual Relationship Detection

Scene graph construction / visual relationship detection from an image a...
research
02/01/2019

Rethinking Visual Relationships for High-level Image Understanding

Relationships, as the bond of isolated entities in images, reflect the i...
research
04/07/2023

Devil's on the Edges: Selective Quad Attention for Scene Graph Generation

Scene graph generation aims to construct a semantic graph structure from...
research
05/27/2021

Relational Gating for "What If" Reasoning

This paper addresses the challenge of learning to do procedural reasonin...
research
04/11/2017

Detecting Visual Relationships with Deep Relational Networks

Relationships among objects play a crucial role in image understanding. ...

Please sign up or login with your details

Forgot password? Click here to reset