Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

03/31/2020
by   Ziyu Liu, et al.
4

Spatial-temporal graphs have been widely used by skeleton-based action recognition algorithms to model human action dynamics. To capture robust movement patterns from these graphs, long-range and multi-scale context aggregation and spatial-temporal dependency modeling are critical aspects of a powerful feature extractor. However, existing methods have limitations in achieving (1) unbiased long-range joint relationship modeling under multi-scale operators and (2) unobstructed cross-spacetime information flow for capturing complex spatial-temporal dependencies. In this work, we present (1) a simple method to disentangle multi-scale graph convolutions and (2) a unified spatial-temporal graph convolutional operator named G3D. The proposed multi-scale aggregation scheme disentangles the importance of nodes in different neighborhoods for effective long-range modeling. The proposed G3D module leverages dense cross-spacetime edges as skip connections for direct information propagation across the spatial-temporal graph. By coupling these proposals, we develop a powerful feature extractor named MS-G3D based on which our model outperforms previous state-of-the-art methods on three large-scale datasets: NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2022

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Graph convolutional networks have been widely used for skeleton-based ac...
research
07/14/2023

One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching

One-shot skeleton action recognition, which aims to learn a skeleton act...
research
08/18/2022

Spatial Temporal Graph Attention Network for Skeleton-Based Action Recognition

It's common for current methods in skeleton-based action recognition to ...
research
05/30/2020

Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts

Understanding sequential information is a fundamental task for artificia...
research
11/07/2021

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

Skeleton data is of low dimension. However, there is a trend of using ve...
research
05/31/2019

Graph WaveNet for Deep Spatial-Temporal Graph Modeling

Spatial-temporal graph modeling is an important task to analyze the spat...
research
01/12/2022

Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction

The aim of this work is to contribute to the development of a tactile de...

Please sign up or login with your details

Forgot password? Click here to reset