Language Supervised Training for Skeleton-based Action Recognition

08/10/2022
by   Wangmeng Xiang, et al.
0

Skeleton-based action recognition has drawn a lot of attention for its computation efficiency and robustness to lighting conditions. Existing skeleton-based action recognition methods are typically formulated as a one-hot classification task without fully utilizing the semantic relations between actions. For example, "make victory sign" and "thumb up" are two actions of hand gestures, whose major difference lies in the movement of hands. This information is agnostic from the categorical one-hot encoding of action classes but could be unveiled in the language description of actions. Therefore, utilizing action language descriptions in training could potentially benefit representation learning. In this work, we propose a Language Supervised Training (LST) approach for skeleton-based action recognition. More specifically, we employ a large-scale language model as the knowledge engine to provide text descriptions for body parts movements of actions, and propose a multi-modal training scheme by utilizing the text encoder to generate feature vectors for different body parts and supervise the skeleton encoder for action representation learning. Experiments show that our proposed LST method achieves noticeable improvements over various baseline models without extra computation cost at inference. LST achieves new state-of-the-arts on popular skeleton-based action recognition benchmarks, including NTU RGB+D, NTU RGB+D 120 and NW-UCLA. The code can be found at https://github.com/MartinXM/LST.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2021

Skeleton Based Sign Language Recognition Using Whole-body Keypoints

Sign language is a visual language that is used by deaf or speech impair...
research
04/14/2023

Skeleton-based action analysis for ADHD diagnosis

Attention Deficit Hyperactivity Disorder (ADHD) is a common neurobehavio...
research
11/27/2019

PREDICT CLUSTER: Unsupervised Skeleton Based Action Recognition

We propose a novel system for unsupervised skeleton-based action recogni...
research
07/17/2022

Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition

Rapid progress and superior performance have been achieved for skeleton-...
research
11/14/2020

Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition

In this paper, we focus on unsupervised representation learning for skel...
research
08/21/2023

Local Spherical Harmonics Improve Skeleton-Based Hand Action Recognition

Hand action recognition is essential. Communication, human-robot interac...
research
09/07/2022

Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action Recognition

Skeleton-based human action recognition is a longstanding challenge due ...

Please sign up or login with your details

Forgot password? Click here to reset