Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives

07/23/2021
by   Ben Saunders, et al.
16

It is common practice to represent spoken languages at their phonetic level. However, for sign languages, this implies breaking motion into its constituent motion primitives. Avatar based Sign Language Production (SLP) has traditionally done just this, building up animation from sequences of hand motions, shapes and facial expressions. However, more recent deep learning based solutions to SLP have tackled the problem using a single network that estimates the full skeletal structure. We propose splitting the SLP task into two distinct jointly-trained sub-tasks. The first translation sub-task translates from spoken language to a latent sign language representation, with gloss supervision. Subsequently, the animation sub-task aims to produce expressive sign language sequences that closely resemble the learnt spatio-temporal representation. Using a progressive transformer for the translation sub-task, we propose a novel Mixture of Motion Primitives (MoMP) architecture for sign language animation. A set of distinct motion primitives are learnt during training, that can be temporally combined at inference to animate continuous sign language sequences. We evaluate on the challenging RWTH-PHOENIX-Weather-2014T(PHOENIX14T) dataset, presenting extensive ablation studies and showing that MoMP outperforms baselines in user evaluations. We achieve state-of-the-art back translation performance with an 11 Importantly, and for the first time, we showcase stronger performance for a full translation pipeline going from spoken language to sign, than from gloss to sign.

READ FULL TEXT
research
03/11/2021

Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks

Sign languages are multi-channel visual languages, where signers use a c...
research
04/30/2020

Progressive Transformers for End-to-End Sign Language Production

The goal of automatic Sign Language Production (SLP) is to translate spo...
research
11/19/2020

Everybody Sign Now: Translating Spoken Language to Photo Realistic Sign Language Video

To be truly understandable and accepted by Deaf communities, an automati...
research
12/06/2021

Skeletal Graph Self-Attention: Embedding a Skeleton Inductive Bias into Sign Language Production

Recent approaches to Sign Language Production (SLP) have adopted spoken ...
research
11/14/2021

Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

Sign language translation (SLT), which generates text in a spoken langua...
research
09/01/2020

Multi-channel Transformers for Multi-articulatory Sign Language Translation

Sign languages use multiple asynchronous information channels (articulat...
research
09/21/2023

Autoregressive Sign Language Production: A Gloss-Free Approach with Discrete Representations

Gloss-free Sign Language Production (SLP) offers a direct translation of...

Please sign up or login with your details

Forgot password? Click here to reset