Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks

03/11/2021
by   Ben Saunders, et al.
4

Sign languages are multi-channel visual languages, where signers use a continuous 3D space to communicate.Sign Language Production (SLP), the automatic translation from spoken to sign languages, must embody both the continuous articulation and full morphology of sign to be truly understandable by the Deaf community. Previous deep learning-based SLP works have produced only a concatenation of isolated signs focusing primarily on the manual features, leading to a robotic and non-expressive production. In this work, we propose a novel Progressive Transformer architecture, the first SLP model to translate from spoken language sentences to continuous 3D multi-channel sign pose sequences in an end-to-end manner. Our transformer network architecture introduces a counter decoding that enables variable length continuous sequence generation by tracking the production progress over time and predicting the end of sequence. We present extensive data augmentation techniques to reduce prediction drift, alongside an adversarial training regime and a Mixture Density Network (MDN) formulation to produce realistic and expressive sign pose sequences. We propose a back translation evaluation mechanism for SLP, presenting benchmark quantitative results on the challenging PHOENIX14T dataset and setting baselines for future research. We further provide a user evaluation of our SLP model, to understand the Deaf reception of our sign pose productions.

READ FULL TEXT

page 2

page 12

page 18

page 19

research
04/30/2020

Progressive Transformers for End-to-End Sign Language Production

The goal of automatic Sign Language Production (SLP) is to translate spo...
research
08/27/2020

Adversarial Training for Multi-Channel Sign Language Production

Sign Languages are rich multi-channel languages, requiring articulation ...
research
07/23/2021

Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives

It is common practice to represent spoken languages at their phonetic le...
research
08/12/2022

Non-Autoregressive Sign Language Production via Knowledge Distillation

Sign Language Production (SLP) aims to translate expressions in spoken l...
research
11/24/2022

Ham2Pose: Animating Sign Language Notation into Pose Sequences

Translating spoken languages into Sign languages is necessary for open c...
research
03/29/2022

Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production

Sign languages are visual languages, with vocabularies as rich as their ...
research
09/21/2023

Autoregressive Sign Language Production: A Gloss-Free Approach with Discrete Representations

Gloss-free Sign Language Production (SLP) offers a direct translation of...

Please sign up or login with your details

Forgot password? Click here to reset