Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition

10/20/2020
by   Yi-Fan Song, et al.
0

One essential problem in skeleton-based action recognition is how to extract discriminative features over all skeleton joints. However, the complexity of the State-Of-The-Art (SOTA) models of this task tends to be exceedingly sophisticated and over-parameterized, where the low efficiency in model training and inference has obstructed the development in the field, especially for large-scale action datasets. In this work, we propose an efficient but strong baseline based on Graph Convolutional Network (GCN), where three main improvements are aggregated, i.e., early fused Multiple Input Branches (MIB), Residual GCN (ResGCN) with bottleneck structure and Part-wise Attention (PartAtt) block. Firstly, an MIB is designed to enrich informative skeleton features and remain compact representations at an early fusion stage. Then, inspired by the success of the ResNet architecture in Convolutional Neural Network (CNN), a ResGCN module is introduced in GCN to alleviate computational costs and reduce learning difficulties in model training while maintain the model accuracy. Finally, a PartAtt block is proposed to discover the most essential body parts over a whole action sequence and obtain more explainable representations for different skeleton action sequences. Extensive experiments on two large-scale datasets, i.e., NTU RGB+D 60 and 120, validate that the proposed baseline slightly outperforms other SOTA models and meanwhile requires much fewer parameters during training and inference procedures, e.g., at most 34 times less than DGNN, which is one of the best SOTA methods.

READ FULL TEXT

page 3

page 5

research
06/29/2021

Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition

One essential problem in skeleton-based action recognition is how to ext...
research
08/09/2020

Richly Activated Graph Convolutional Network for Robust Skeleton-based Action Recognition

Current methods for skeleton-based human action recognition usually work...
research
07/29/2020

Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Graph Convolutional Networks (GCNs) have attracted increasing interests ...
research
10/23/2020

Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition

Graph convolutional networks (GCNs) have been very successful in modelin...
research
12/09/2018

A Structured Model For Action Detection

A dominant paradigm for learning-based approaches in computer vision is ...
research
11/11/2019

Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching

Human action recognition from skeleton data, fueled by the Graph Convolu...
research
09/21/2022

Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition

Skeleton-based action recognition receives increasing attention because ...

Please sign up or login with your details

Forgot password? Click here to reset