Pose-based Sign Language Recognition using GCN and BERT

12/01/2020
by   Anirudh Tunga, et al.
0

Sign language recognition (SLR) plays a crucial role in bridging the communication gap between the hearing and vocally impaired community and the rest of the society. Word-level sign language recognition (WSLR) is the first important step towards understanding and interpreting sign language. However, recognizing signs from videos is a challenging task as the meaning of a word depends on a combination of subtle body motions, hand configurations, and other movements. Recent pose-based architectures for WSLR either model both the spatial and temporal dependencies among the poses in different frames simultaneously or only model the temporal information without fully utilizing the spatial information. We tackle the problem of WSLR using a novel pose-based approach, which captures spatial and temporal information separately and performs late fusion. Our proposed architecture explicitly captures the spatial interactions in the video using a Graph Convolutional Network (GCN). The temporal dependencies between the frames are captured using Bidirectional Encoder Representations from Transformers (BERT). Experimental results on WLASL, a standard word-level sign language recognition dataset show that our model significantly outperforms the state-of-the-art on pose-based methods by achieving an improvement in the prediction accuracy by up to 5

READ FULL TEXT

page 4

page 6

page 7

research
01/31/2019

Spatial-Temporal Graph Convolutional Networks for Sign Language Recognition

The recognition of sign language is a challenging task with an important...
research
10/24/2019

Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison

Vision-based sign language recognition aims at helping the hearing-impai...
research
04/11/2023

Multi-Graph Convolution Network for Pose Forecasting

Recently, there has been a growing interest in predicting human motion, ...
research
09/21/2023

SlowFast Network for Continuous Sign Language Recognition

The objective of this work is the effective extraction of spatial and dy...
research
12/21/2022

SLGTformer: An Attention-Based Approach to Sign Language Recognition

Sign language is the preferred method of communication of deaf or mute p...
research
01/11/2020

Towards Generalizable Surgical Activity Recognition Using Spatial Temporal Graph Convolutional Networks

Modeling and recognition of surgical activities poses an interesting res...
research
12/04/2019

Trajectory-Based Recognition of Dynamic Persian Sign Language Using Hidden Markov Model

Sign Language Recognition (SLR) is an important step in facilitating the...

Please sign up or login with your details

Forgot password? Click here to reset