SignNet: Single Channel Sign Generation using Metric Embedded Learning

A true interpreting agent not only understands sign language and translates to text, but also understands text and translates to signs. Much of the AI work in sign language translation to date has focused mainly on translating from signs to text. Towards the latter goal, we propose a text-to-sign translation model, SignNet, which exploits the notion of similarity (and dissimilarity) of visual signs in translating. This module presented is only one part of a dual-learning two task process involving text-to-sign (T2S) as well as sign-to-text (S2T). We currently implement SignNet as a single channel architecture so that the output of the T2S task can be fed into S2T in a continuous dual learning framework. By single channel, we refer to a single modality, the body pose joints. In this work, we present SignNet, a T2S task using a novel metric embedding learning process, to preserve the distances between sign embeddings relative to their dissimilarity. We also describe how to choose positive and negative examples of signs for similarity testing. From our analysis, we observe that metric embedding learning-based model perform significantly better than the other models with traditional losses, when evaluated using BLEU scores. In the task of gloss to pose, SignNet performed as well as its state-of-the-art (SoTA) counterparts and outperformed them in the task of text to pose, by showing noteworthy enhancements in BLEU 1 - BLEU 4 scores (BLEU 1: 31->39;  26 improvement and BLEU 4: 10.43->11.84;  14% improvement) when tested on the popular RWTH PHOENIX-Weather-2014T benchmark dataset

READ FULL TEXT

page 2

page 3

page 4

page 6

research
04/01/2020

Sign Language Translation with Transformers

Sign Language Translation (SLT) first uses a Sign Language Recognition (...
research
12/02/2022

Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22

This paper describes the system developed at the Universitat Politècnica...
research
04/13/2023

Sign Language Translation from Instructional Videos

The advances in automatic sign language translation (SLT) to spoken lang...
research
10/24/2022

Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation

This paper describes Microsoft's submission to the first shared task on ...
research
09/04/2023

Attention-Driven Multi-Modal Fusion: Enhancing Sign Language Recognition and Translation

In this paper, we devise a mechanism for the addition of multi-modal inf...
research
09/01/2020

Multi-channel Transformers for Multi-articulatory Sign Language Translation

Sign languages use multiple asynchronous information channels (articulat...
research
08/30/2023

SignDiff: Learning Diffusion Models for American Sign Language Production

The field of Sign Language Production (SLP) lacked a large-scale, pre-tr...

Please sign up or login with your details

Forgot password? Click here to reset