Reasoning About Human-Object Interactions Through Dual Attention Networks

09/10/2019
by   Tete Xiao, et al.
3

Objects are entities we act upon, where the functionality of an object is determined by how we interact with it. In this work we propose a Dual Attention Network model which reasons about human-object interactions. The dual-attentional framework weights the important features for objects and actions respectively. As a result, the recognition of objects and actions mutually benefit each other. The proposed model shows competitive classification performance on the human-object interaction dataset Something-Something. Besides, it can perform weak spatiotemporal localization and affordance segmentation, despite being trained only with video-level labels. The model not only finds when an action is happening and which object is being manipulated, but also identifies which part of the object is being interacted with. Project page: <https://dual-attention-network.github.io/>.

READ FULL TEXT

page 4

page 7

page 8

research
06/03/2019

Grounded Human-Object Interaction Hotspots from Video (Extended Abstract)

Learning how to interact with objects is an important step towards embod...
research
03/19/2018

Attention-GAN for Object Transfiguration in Wild Images

This paper studies the object transfiguration problem in wild images. Th...
research
04/20/2022

THORN: Temporal Human-Object Relation Network for Action Recognition

Most action recognition models treat human activities as unitary events....
research
12/11/2018

Grounded Human-Object Interaction Hotspots from Video

Learning how to interact with objects is an important step towards embod...
research
01/07/2020

Visual-Semantic Graph Attention Network for Human-Object Interaction Detection

In scene understanding, machines benefit from not only detecting individ...
research
10/05/2017

A self-organizing neural network architecture for learning human-object interactions

The visual recognition of transitive actions comprising human-object int...
research
06/11/2022

Precise Affordance Annotation for Egocentric Action Video Datasets

Object affordance is an important concept in human-object interaction, p...

Please sign up or login with your details

Forgot password? Click here to reset