SlowFast Network for Continuous Sign Language Recognition

09/21/2023
by   Junseok Ahn, et al.
0

The objective of this work is the effective extraction of spatial and dynamic features for Continuous Sign Language Recognition (CSLR). To accomplish this, we utilise a two-pathway SlowFast network, where each pathway operates at distinct temporal resolutions to separately capture spatial (hand shapes, facial expressions) and dynamic (movements) information. In addition, we introduce two distinct feature fusion methods, carefully designed for the characteristics of CSLR: (1) Bi-directional Feature Fusion (BFF), which facilitates the transfer of dynamic semantics into spatial semantics and vice versa; and (2) Pathway Feature Enhancement (PFE), which enriches dynamic and spatial representations through auxiliary subnetworks, while avoiding the need for extra inference time. As a result, our model further strengthens spatial and dynamic representations in parallel. We demonstrate that the proposed framework outperforms the current state-of-the-art performance on popular CSLR datasets, including PHOENIX14, PHOENIX14-T, and CSL-Daily.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2022

Multi-View Spatial-Temporal Network for Continuous Sign Language Recognition

Sign language is a beautiful visual language and is also the primary lan...
research
10/12/2021

Sign Language Recognition via Skeleton-Aware Multi-Model Ensemble

Sign language is commonly used by deaf or mute people to communicate but...
research
12/01/2020

Pose-based Sign Language Recognition using GCN and BERT

Sign language recognition (SLR) plays a crucial role in bridging the com...
research
01/31/2019

Spatial-Temporal Graph Convolutional Networks for Sign Language Recognition

The recognition of sign language is a challenging task with an important...
research
12/26/2022

Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal

Most deep-learning-based continuous sign language recognition (CSLR) mod...
research
07/18/2022

Temporal Lift Pooling for Continuous Sign Language Recognition

Pooling methods are necessities for modern neural networks for increasin...

Please sign up or login with your details

Forgot password? Click here to reset