AVR: Attention based Salient Visual Relationship Detection

03/16/2020
by   Jianming Lv, et al.
0

Visual relationship detection aims to locate objects in images and recognize the relationships between objects. Traditional methods treat all observed relationships in an image equally, which causes a relatively poor performance in the detection tasks on complex images with abundant visual objects and various relationships. To address this problem, we propose an attention based model, namely AVR, to achieve salient visual relationships based on both local and global context of the relationships. Specifically, AVR recognizes relationships and measures the attention on the relationships in the local context of an input image by fusing the visual features, semantic and spatial information of the relationships. AVR then applies the attention to assign important relationships with larger salient weights for effective information filtering. Furthermore, AVR is integrated with the priori knowledge in the global context of image datasets to improve the precision of relationship prediction, where the context is modeled as a heterogeneous graph to measure the priori probability of relationships based on the random walk algorithm. Comprehensive experiments are conducted to demonstrate the effectiveness of AVR in several real-world image datasets, and the results show that AVR outperforms state-of-the-art visual relationship detection methods significantly by up to 87.5% in terms of recall.

READ FULL TEXT

page 1

page 3

page 8

research
03/08/2021

Relationship-based Neural Baby Talk

Understanding interactions between objects in an image is an important e...
research
09/24/2020

Local Context Attention for Salient Object Segmentation

Salient object segmentation aims at distinguishing various salient objec...
research
03/20/2019

On Class Imbalance and Background Filtering in Visual Relationship Detection

In this paper we investigate the problems of class imbalance and irrelev...
research
12/01/2019

Interpreting Context of Images using Scene Graphs

Understanding a visual scene incorporates objects, relationships, and co...
research
09/23/2019

Deep Local Global Refinement Network for Stent Analysis in IVOCT Images

Implantation of stents into coronary arteries is a common treatment opti...
research
03/17/2023

High Accurate and Explainable Multi-Pill Detection Framework with Graph Neural Network-Assisted Multimodal Data Fusion

Due to the significant resemblance in visual appearance, pill misuse is ...
research
04/11/2017

Detecting Visual Relationships with Deep Relational Networks

Relationships among objects play a crucial role in image understanding. ...

Please sign up or login with your details

Forgot password? Click here to reset