HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment

02/15/2022
by   Mu-Ruei Tseng, et al.
1

Human actions involve complex pose variations and their 2D projections can be highly ambiguous. Thus 3D spatio-temporal or 4D (i.e., 3D+T) human skeletons, which are photometric and viewpoint invariant, are an excellent alternative to 2D+T skeletons/pixels to improve action recognition accuracy. This paper proposes a new 4D dataset HAA4D which consists of more than 3,300 RGB videos in 300 human atomic action classes. HAA4D is clean, diverse, class-balanced where each class is viewpoint-balanced with the use of 4D skeletons, in which as few as one 4D skeleton per class is sufficient for training a deep recognition model. Further, the choice of atomic actions makes annotation even easier, because each video clip lasts for only a few seconds. All training and testing 3D skeletons in HAA4D are globally aligned, using a deep alignment model to the same global space, making each skeleton face the negative z-direction. Such alignment makes matching skeletons more stable by reducing intraclass variations and thus with fewer training samples per class needed for action recognition. Given the high diversity and skeletal alignment in HAA4D, we construct the first baseline few-shot 4D human atomic action recognition network without bells and whistles, which produces comparable or higher performance than relevant state-of-the-art techniques relying on embedded space encoding without explicit skeletal alignment, using the same small number of training samples of unseen classes.

READ FULL TEXT

page 1

page 2

page 5

page 6

research
11/17/2020

Semi-Supervised Few-Shot Atomic Action Recognition

Despite excellent progress has been made, the performance on action reco...
research
09/01/2020

View-invariant action recognition

Human action recognition is an important problem in computer vision. It ...
research
09/11/2020

HAA500: Human-Centric Atomic Action Dataset with Curated Videos

We contribute HAA500, a manually annotated human-centric atomic action d...
research
12/02/2019

Skeleton based Activity Recognition by Fusing Part-wise Spatio-temporal and Attention Driven Residues

There exist a wide range of intra class variations of the same actions a...
research
12/09/2019

Synthetic Humans for Action Recognition from Unseen Viewpoints

Our goal in this work is to improve the performance of human action reco...
research
07/12/2022

Compound Prototype Matching for Few-shot Action Recognition

Few-shot action recognition aims to recognize novel action classes using...
research
09/15/2017

Viewpoint Invariant Action Recognition using RGB-D Videos

In video-based action recognition, viewpoint variations often pose major...

Please sign up or login with your details

Forgot password? Click here to reset