A Two-stream Neural Network for Pose-based Hand Gesture Recognition

01/22/2021
by   Chuankun Li, et al.
6

Pose based hand gesture recognition has been widely studied in the recent years. Compared with full body action recognition, hand gesture involves joints that are more spatially closely distributed with stronger collaboration. This nature requires a different approach from action recognition to capturing the complex spatial features. Many gesture categories, such as "Grab" and "Pinch", have very similar motion or temporal patterns posing a challenge on temporal processing. To address these challenges, this paper proposes a two-stream neural network with one stream being a self-attention based graph convolutional network (SAGCN) extracting the short-term temporal information and hierarchical spatial information, and the other being a residual-connection enhanced bidirectional Independently Recurrent Neural Network (RBi-IndRNN) for extracting long-term temporal information. The self-attention based graph convolutional network has a dynamic self-attention mechanism to adaptively exploit the relationships of all hand joints in addition to the fixed topology and local feature extraction in the GCN. On the other hand, the residual-connection enhanced Bi-IndRNN extends an IndRNN with the capability of bidirectional processing for temporal modelling. The two streams are fused together for recognition. The Dynamic Hand Gesture dataset and First-Person Hand Action dataset are used to validate its effectiveness, and our method achieves state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 7

page 9

page 10

research
06/25/2021

HAN: An Efficient Hierarchical Self-Attention Network for Skeleton-Based Gesture Recognition

Previous methods for skeleton-based gesture recognition mostly arrange t...
research
12/17/2021

Self-attention based anchor proposal for skeleton-based action recognition

Skeleton sequences are widely used for action recognition task due to it...
research
02/25/2023

Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition

Skeleton-based action recognition has become popular in recent years due...
research
04/19/2018

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Acquiring spatio-temporal states of an action is the most crucial step f...
research
03/19/2018

Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition

Research in human action recognition has accelerated significantly since...
research
06/05/2015

Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video

Recent studies have demonstrated the power of recurrent neural networks ...

Please sign up or login with your details

Forgot password? Click here to reset