Integrating Human Parsing and Pose Network for Human Action Recognition

07/16/2023
by   Runwei Ding, et al.
0

Human skeletons and RGB sequences are both widely-adopted input modalities for human action recognition. However, skeletons lack appearance features and color data suffer large amount of irrelevant depiction. To address this, we introduce human parsing feature map as a novel modality, since it can selectively retain spatiotemporal features of the body parts, while filtering out noises regarding outfits, backgrounds, etc. We propose an Integrating Human Parsing and Pose Network (IPP-Net) for action recognition, which is the first to leverage both skeletons and human parsing feature maps in dual-branch approach. The human pose branch feeds compact skeletal representations of different modalities in graph convolutional network to model pose features. In human parsing branch, multi-frame body-part parsing features are extracted with human detector and parser, which is later learnt using a convolutional backbone. A late ensemble of two branches is adopted to get final predictions, considering both robust keypoints and rich semantic body-part features. Extensive experiments on NTU RGB+D and NTU RGB+D 120 benchmarks consistently verify the effectiveness of the proposed IPP-Net, which outperforms the existing action recognition methods. Our code is publicly available at https://github.com/liujf69/IPP-Net-Parsing .

READ FULL TEXT

page 2

page 6

page 9

research
11/28/2019

Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision

Pose-based action recognition has drawn considerable attention recently....
research
10/07/2021

A Baseline Framework for Part-level Action Parsing and Action Recognition

This technical report introduces our 2nd place solution to Kinetics-TPS ...
research
12/18/2022

2D Pose Estimation based Child Action Recognition

We present a graph convolutional network with 2D pose estimation for the...
research
05/04/2020

Correlating Edge, Pose with Parsing

According to existing studies, human body edge and pose are two benefici...
research
12/15/2019

Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition

Human pose estimation and action recognition are related tasks since bot...
research
11/05/2021

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing

Part-level Action Parsing aims at part state parsing for boosting action...
research
07/28/2014

A discussion on the validation tests employed to compare human action recognition methods using the MSR Action3D dataset

This paper aims to determine which is the best human action recognition ...

Please sign up or login with your details

Forgot password? Click here to reset