Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection

07/19/2023
by   Guangzhi Wang, et al.
0

Human-Object Interaction Detection is a crucial aspect of human-centric scene understanding, with important applications in various domains. Despite recent progress in this field, recognizing subtle and detailed interactions remains challenging. Existing methods try to use human-related clues to alleviate the difficulty, but rely heavily on external annotations or knowledge, limiting their practical applicability in real-world scenarios. In this work, we propose a novel Part Semantic Network (PSN) to solve this problem. The core of PSN is a Conditional Part Attention (CPA) mechanism, where human features are taken as keys and values, and the object feature is used as query for the computation in a cross-attention mechanism. In this way, our model learns to automatically focus on the most informative human parts conditioned on the involved object, generating more semantically meaningful features for interaction recognition. Additionally, we propose an Occluded Part Extrapolation (OPE) strategy to facilitate interaction recognition under occluded scenarios, which teaches the model to extrapolate detailed features from partially occluded ones. Our method consistently outperforms prior approaches on the V-COCO and HICO-DET datasets, without external data or extra annotations. Additional ablation studies validate the effectiveness of each component of our proposed method.

READ FULL TEXT

page 1

page 3

page 8

research
08/30/2018

iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection

Recent years have witnessed rapid progress in detecting and recognizing ...
research
03/02/2023

BIFRNet: A Brain-Inspired Feature Restoration DNN for Partially Occluded Image Recognition

The partially occluded image recognition (POIR) problem has been a chall...
research
04/11/2022

Category-Aware Transformer Network for Better Human-Object Interaction Detection

Human-Object Interactions (HOI) detection, which aims to localize a huma...
research
09/18/2019

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

Reasoning human object interactions is a core problem in human-centric s...
research
07/05/2022

Distance Matters in Human-Object Interaction Detection

Human-Object Interaction (HOI) detection has received considerable atten...
research
04/27/2023

Compositional 3D Human-Object Neural Animation

Human-object interactions (HOIs) are crucial for human-centric scene und...
research
06/09/2020

Cost-effective Interactive Attention Learning with Neural Attention Processes

We propose a novel interactive learning framework which we refer to as I...

Please sign up or login with your details

Forgot password? Click here to reset