Towards the extraction of robust sign embeddings for low resource sign language recognition

06/30/2023
by   Mathieu De Coster, et al.
0

Isolated Sign Language Recognition (SLR) has mostly been applied on relatively large datasets containing signs executed slowly and clearly by a limited group of signers. In real-world scenarios, however, we are met with challenging visual conditions, coarticulated signing, small datasets, and the need for signer independent models. To tackle this difficult problem, we require a robust feature extractor to process the sign language videos. One could expect human pose estimators to be ideal candidates. However, due to a domain mismatch with their training sets and challenging poses in sign language, they lack robustness on sign language data and image based models often still outperform keypoint based models. Furthermore, whereas the common practice of transfer learning with image based models yields even higher accuracy, keypoint based models are typically trained from scratch on every SLR dataset. These factors limit their usefulness for SLR. From the existing literature, it is also not clear which, if any, pose estimator performs best for SLR. We compare the three most popular pose estimators for SLR: OpenPose, MMPose and MediaPipe. We show that through keypoint normalization, missing keypoint imputation, and learning a pose embedding, we can obtain significantly better results and enable transfer learning. We show that keypoint-based embeddings contain cross-lingual features: they can transfer between sign languages and achieve competitive performance even when fine-tuning only the classifier layer of an SLR model on a target sign language. We furthermore achieve better performance using fine-tuned transferred embeddings than models trained only on the target sign language. The application of these embeddings could prove particularly useful for low resource sign languages in the future.

READ FULL TEXT

page 6

page 11

page 12

research
10/12/2021

OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages

AI technologies for Natural Languages have made tremendous progress rece...
research
06/03/2020

Transfer Learning for British Sign Language Modelling

Automatic speech recognition and spoken dialogue systems have made great...
research
08/18/2023

Learnt Contrastive Concept Embeddings for Sign Recognition

In natural language processing (NLP) of spoken languages, word embedding...
research
03/22/2023

CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning

This work focuses on sign language retrieval-a recently proposed task fo...
research
08/21/2023

Improving Continuous Sign Language Recognition with Cross-Lingual Signs

This work dedicates to continuous sign language recognition (CSLR), whic...
research
01/10/2022

TFS Recognition: Investigating MPH]Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials

Thai Finger Spelling (TFS) sign recognition could benefit a community of...
research
05/06/2021

Pose-Guided Sign Language Video GAN with Dynamic Lambda

We propose a novel approach for the synthesis of sign language videos us...

Please sign up or login with your details

Forgot password? Click here to reset