Knowledge Guided Bidirectional Attention Network for Human-Object Interaction Detection

07/16/2022
by   Jingjia Huang, et al.
0

Human Object Interaction (HOI) detection is a challenging task that requires to distinguish the interaction between a human-object pair. Attention based relation parsing is a popular and effective strategy utilized in HOI. However, current methods execute relation parsing in a "bottom-up" manner. We argue that the independent use of the bottom-up parsing strategy in HOI is counter-intuitive and could lead to the diffusion of attention. Therefore, we introduce a novel knowledge-guided top-down attention into HOI, and propose to model the relation parsing as a "look and search" process: execute scene-context modeling (i.e. look), and then, given the knowledge of the target pair, search visual clues for the discrimination of the interaction between the pair. We implement the process via unifying the bottom-up and top-down attention in a single encoder-decoder based model. The experimental results show that our model achieves competitive performance on the V-COCO and HICO-DET datasets.

READ FULL TEXT

page 1

page 8

research
08/26/2020

DRG: Dual Relation Graph for Human-Object Interaction Detection

We tackle the challenging problem of human-object interaction (HOI) dete...
research
08/02/2021

GTNet:Guided Transformer Network for Detecting Human-Object Interactions

The human-object interaction (HOI) detection task refers to localizing h...
research
03/11/2020

VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions

Comprehensive visual understanding requires detection frameworks that ca...
research
03/19/2021

ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation

Text-based video segmentation is a challenging task that segments out th...
research
07/15/2021

What and When to Look?: Temporal Span Proposal Network for Video Visual Relation Detection

Identifying relations between objects is central to understanding the sc...
research
04/11/2023

Relational Context Learning for Human-Object Interaction Detection

Recent state-of-the-art methods for HOI detection typically build on tra...
research
02/08/2021

In-Order Chart-Based Constituent Parsing

We propose a novel in-order chart-based model for constituent parsing. C...

Please sign up or login with your details

Forgot password? Click here to reset