Hand-Object Interaction and Precise Localization in Transitive Action Recognition

11/12/2015
by   Amir Rosenfeld, et al.
0

Action recognition in still images has seen major improvement in recent years due to advances in human pose estimation, object recognition and stronger feature representations produced by deep neural networks. However, there are still many cases in which performance remains far from that of humans. A major difficulty arises in distinguishing between transitive actions in which the overall actor pose is similar, and recognition therefore depends on details of the grasp and the object, which may be largely occluded. In this paper we demonstrate how recognition is improved by obtaining precise localization of the action-object and consequently extracting details of the object shape together with the actor-object interaction. To obtain exact localization of the action object and its interaction with the actor, we employ a coarse-to-fine approach which combines semantic segmentation and contextual features, in successive stages. We focus on (but are not limited) to face-related actions, a set of actions that includes several currently challenging categories. We present an average relative improvement of 35 validate through experimentation the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 7

research
01/17/2016

Face-space Action Recognition by Face-Object Interactions

Action recognition in still images has seen major improvement in recent ...
research
07/20/2023

MSQNet: Actor-agnostic Action Recognition with Multi-modal Query

Existing action recognition methods are typically actor-specific due to ...
research
10/07/2021

A Multi-viewpoint Outdoor Dataset for Human Action Recognition

Advancements in deep neural networks have contributed to near perfect re...
research
04/19/2022

ActAR: Actor-Driven Pose Embeddings for Video Action Recognition

Human action recognition (HAR) in videos is one of the core tasks of vid...
research
06/22/2019

Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019

In this report, we present the Baidu-UTS submission to the EPIC-Kitchens...
research
09/13/2016

Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups

Pose estimation, tracking, and action recognition of articulated objects...
research
09/12/2023

Grounded Language Acquisition From Object and Action Imagery

Deep learning approaches to natural language processing have made great ...

Please sign up or login with your details

Forgot password? Click here to reset