Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

05/21/2023
by   Haojun Xu, et al.
0

How humans understand and recognize the actions of others is a complex neuroscientific problem that involves a combination of cognitive mechanisms and neural networks. Research has shown that humans have brain areas that recognize actions that process top-down attentional information, such as the temporoparietal association area. Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe. Skeleton-based action recognition creates mappings for the complex connections between the human skeleton movement patterns and behaviors. Although existing studies encoded meaningful node relationships and synthesized action representations for classification with good results, few of them considered incorporating a priori knowledge to aid potential representation learning for better performance. LA-GCN proposes a graph convolution network using large-scale language models (LLM) knowledge assistance. First, the LLM knowledge is mapped into a priori global relationship (GPR) topology and a priori category relationship (CPR) topology between nodes. The GPR guides the generation of new "bone" representations, aiming to emphasize essential node information from the data level. The CPR mapping simulates category prior knowledge in human brain regions, encoded by the PC-AC module and used to add additional supervision-forcing the model to learn class-distinguishable features. In addition, to improve information transfer efficiency in topology modeling, we propose multi-hop attention graph convolution. It aggregates each node's k-order neighbor simultaneously to speed up model convergence. LA-GCN reaches state-of-the-art on NTU RGB+D, NTU RGB+D 120, and NW-UCLA datasets.

READ FULL TEXT
research
07/29/2020

Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Graph Convolutional Networks (GCNs) have attracted increasing interests ...
research
04/26/2019

Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition

Action recognition with skeleton data has recently attracted much attent...
research
08/30/2023

Topology-aware MLP for Skeleton-based Action Recognition

Graph convolution networks (GCNs) have achieved remarkable performance i...
research
03/31/2022

SpatioTemporal Focus for Skeleton-based Action Recognition

Graph convolutional networks (GCNs) are widely adopted in skeleton-based...
research
04/23/2023

TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potential

Skeleton-based action recognition has achieved remarkable results in hum...
research
03/07/2023

Learning Discriminative Representations for Skeleton Based Action Recognition

Human action recognition aims at classifying the category of human actio...
research
08/03/2021

Skeleton Split Strategies for Spatial Temporal Graph Convolution Networks

A skeleton representation of the human body has been proven to be effect...

Please sign up or login with your details

Forgot password? Click here to reset