Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition

04/19/2022
by   Ronghui Li, et al.
0

Sign language is a beautiful visual language and is also the primary language used by speaking and hearing-impaired people. However, sign language has many complex expressions, which are difficult for the public to understand and master. Sign language recognition algorithms will significantly facilitate communication between hearing-impaired people and normal people. Traditional continuous sign language recognition often uses a sequence learning method based on Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM). These methods can only learn spatial and temporal features separately, which cannot learn the complex spatial-temporal features of sign language. LSTM is also difficult to learn long-term dependencies. To alleviate these problems, this paper proposes a multi-view spatial-temporal continuous sign language recognition network. The network consists of three parts. The first part is a Multi-View Spatial-Temporal Feature Extractor Network (MSTN), which can directly extract the spatial-temporal features of RGB and skeleton data; the second is a sign language encoder network based on Transformer, which can learn long-term dependencies; the third is a Connectionist Temporal Classification (CTC) decoder network, which is used to predict the whole meaning of the continuous sign language. Our algorithm is tested on two public sign language datasets SLR-100 and PHOENIX-Weather 2014T (RWTH). As a result, our method achieves excellent performance on both datasets. The word error rate on the SLR-100 dataset is 1.9 dataset is 22.8

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 10

page 11

page 12

research
07/28/2021

Egyptian Sign Language Recognition Using CNN and LSTM

Sign language is a set of gestures that deaf people use to communicate. ...
research
12/25/2022

StepNet: Spatial-temporal Part-aware Network for Sign Language Recognition

Sign language recognition (SLR) aims to overcome the communication barri...
research
02/08/2020

Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition

Despite the recent success of deep learning in continuous sign language ...
research
09/21/2023

SlowFast Network for Continuous Sign Language Recognition

The objective of this work is the effective extraction of spatial and dy...
research
01/31/2019

Spatial-Temporal Graph Convolutional Networks for Sign Language Recognition

The recognition of sign language is a challenging task with an important...
research
11/07/2022

Temporal superimposed crossover module for effective continuous sign language

The ultimate goal of continuous sign language recognition(CSLR) is to fa...
research
02/22/2023

Multi-View Bangla Sign Language(MV-BSL) Dataset and Continuous BSL Recognition

Being able to express our thoughts, feelings, and ideas to one another i...

Please sign up or login with your details

Forgot password? Click here to reset