GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency

03/11/2020
by   Dongming Yang, et al.
6

Since detecting and recognizing individual human or object are not adequate to understand the visual world, learning how humans interact with surrounding objects becomes a core technology. However, convolution operations are weak in depicting visual interactions between the instances since they only build blocks that process one local neighborhood at a time. To address this problem, we learn from human perception in observing HOIs to introduce a two-stage trainable reasoning mechanism, referred to as GID block. GID block breaks through the local neighborhoods and captures long-range dependency of pixels both in global-level and instance-level from the scene to help detecting interactions between instances. Furthermore, we conduct a multi-stream network called GID-Net, which is a human-object interaction detection framework consisting of a human branch, an object branch and an interaction branch. Semantic information in global-level and local-level are efficiently reasoned and aggregated in each of the branches. We have compared our proposed GID-Net with existing state-of-the-art methods on two public benchmarks, including V-COCO and HICO-DET. The results have showed that GID-Net outperforms the existing best-performing methods on both the above two benchmarks, validating its efficacy in detecting human-object interactions.

READ FULL TEXT

page 5

page 19

page 24

page 25

research
06/30/2022

GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block

For monaural speech enhancement, contextual information is important for...
research
09/18/2019

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

Reasoning human object interactions is a core problem in human-centric s...
research
07/14/2020

A Graph-based Interactive Reasoning for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection devotes to learn how humans int...
research
01/09/2023

Parallel Reasoning Network for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection aims to learn how human interac...
research
08/30/2018

iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection

Recent years have witnessed rapid progress in detecting and recognizing ...
research
08/29/2018

Interact as You Intend: Intention-Driven Human-Object Interaction Detection

The recent advances in instance-level detection tasks lay strong foundat...
research
04/11/2023

Relational Context Learning for Human-Object Interaction Detection

Recent state-of-the-art methods for HOI detection typically build on tra...

Please sign up or login with your details

Forgot password? Click here to reset