DeepAI AI Chat
Log In Sign Up

Skeleton-based Action Recognition of People Handling Objects

by   Sunoh Kim, et al.
Seoul National University

In visual surveillance systems, it is necessary to recognize the behavior of people handling objects such as a phone, a cup, or a plastic bag. In this paper, to address this problem, we propose a new framework for recognizing object-related human actions by graph convolutional networks using human and object poses. In this framework, we construct skeletal graphs of reliable human poses by selectively sampling the informative frames in a video, which include human joints with high confidence scores obtained in pose estimation. The skeletal graphs generated from the sampled frames represent human poses related to the object position in both the spatial and temporal domains, and these graphs are used as inputs to the graph convolutional networks. Through experiments over an open benchmark and our own data sets, we verify the validity of our framework in that our method outperforms the state-of-the-art method for skeleton-based action recognition.


Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition

Learning graph convolutional networks (GCNs) is an emerging field which ...

Generalized Graph Convolutional Networks for Skeleton-based Action Recognition

With the prevalence of accessible depth sensors, dynamic human body skel...

Centrality Graph Convolutional Networks for Skeleton-based Action Recognition

The topological structure of skeleton data plays a significant role in h...

REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition

It is known that the kinematics of the human body skeleton reveals valua...

Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision

Pose-based action recognition has drawn considerable attention recently....

GPRAR: Graph Convolutional Network based Pose Reconstruction and Action Recognition for Human Trajectory Prediction

Prediction with high accuracy is essential for various applications such...

Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction

The aim of this work is to contribute to the development of a tactile de...