Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition

02/26/2023
by   Yifan Jiang, et al.
0

Recently, skeleton-based human action has become a hot research topic because the compact representation of human skeletons brings new blood to this research domain. As a result, researchers began to notice the importance of using RGB or other sensors to analyze human action by extracting skeleton information. Leveraging the rapid development of deep learning (DL), a significant number of skeleton-based human action approaches have been presented with fine-designed DL structures recently. However, a well-trained DL model always demands high-quality and sufficient data, which is hard to obtain without costing high expenses and human labor. In this paper, we introduce a novel data augmentation method for skeleton-based action recognition tasks, which can effectively generate high-quality and diverse sequential actions. In order to obtain natural and realistic action sequences, we propose denoising diffusion probabilistic models (DDPMs) that can generate a series of synthetic action sequences, and their generation process is precisely guided by a spatial-temporal transformer (ST-Trans). Experimental results show that our method outperforms the state-of-the-art (SOTA) motion generation approaches on different naturality and diversity metrics. It proves that its high-quality synthetic data can also be effectively deployed to existing action recognition models with significant performance improvement.

READ FULL TEXT
research
07/14/2023

One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching

One-shot skeleton action recognition, which aims to learn a skeleton act...
research
01/30/2023

Action Capsules: Human Skeleton Action Recognition

Due to the compact and rich high-level representations offered, skeleton...
research
07/20/2022

An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation

Temporal action segmentation (TAS) aims to classify and locate actions i...
research
01/24/2023

Bipartite Graph Diffusion Model for Human Interaction Generation

The generation of natural human motion interactions is a hot topic in co...
research
04/09/2017

Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks

Recently, skeleton based action recognition gains more popularity due to...
research
10/26/2021

IIP-Transformer: Intra-Inter-Part Transformer for Skeleton-Based Action Recognition

Recently, Transformer-based networks have shown great promise on skeleto...
research
04/14/2023

Skeleton-based action analysis for ADHD diagnosis

Attention Deficit Hyperactivity Disorder (ADHD) is a common neurobehavio...

Please sign up or login with your details

Forgot password? Click here to reset