Visual Relationship Detection with Relative Location Mining

11/02/2019
by   Hao Zhou, et al.
15

Visual relationship detection, as a challenging task used to find and distinguish the interactions between object pairs in one image, has received much attention recently. In this work, we propose a novel visual relationship detection framework by deeply mining and utilizing relative location of object-pair in every stage of the procedure. In both the stages, relative location information of each object-pair is abstracted and encoded as auxiliary feature to improve the distinguishing capability of object-pairs proposing and predicate recognition, respectively; Moreover, one Gated Graph Neural Network(GGNN) is introduced to mine and measure the relevance of predicates using relative location. With the location-based GGNN, those non-exclusive predicates with similar spatial position can be clustered firstly and then be smoothed with close classification scores, thus the accuracy of top n recall can be increased further. Experiments on two widely used datasets VRD and VG show that, with the deeply mining and exploiting of relative location information, our proposed model significantly outperforms the current state-of-the-art.

READ FULL TEXT

page 1

page 3

page 8

research
03/11/2020

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Comprehensive visual understanding requires detection frameworks that ca...
research
11/29/2021

Agent-Centric Relation Graph for Object Visual Navigation

Object visual navigation aims to steer an agent towards a target object ...
research
05/05/2019

On Exploring Undetermined Relationships for Visual Relationship Detection

In visual relationship detection, human-notated relationships can be reg...
research
08/11/2021

Mining the Benefits of Two-stage and One-stage HOI Detection

Two-stage methods have dominated Human-Object Interaction (HOI) detectio...
research
04/09/2020

Spatial Priming for Detecting Human-Object Interactions

The relative spatial layout of a human and an object is an important cue...
research
12/09/2015

Window-Object Relationship Guided Representation Learning for Generic Object Detections

In existing works that learn representation for object detection, the re...
research
05/28/2018

Visual Relationship Detection Based on Guided Proposals and Semantic Knowledge Distillation

A thorough comprehension of image content demands a complex grasp of the...

Please sign up or login with your details

Forgot password? Click here to reset