Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

11/14/2021
by   Jichao Kan, et al.
0

Sign language translation (SLT), which generates text in a spoken language from visual content in a sign language, is important to assist the hard-of-hearing community for their communications. Inspired by neural machine translation (NMT), most existing SLT studies adopted a general sequence to sequence learning strategy. However, SLT is significantly different from general NMT tasks since sign languages convey messages through multiple visual-manual aspects. Therefore, in this paper, these unique characteristics of sign languages are formulated as hierarchical spatio-temporal graph representations, including high-level and fine-level graphs of which a vertex characterizes a specified body part and an edge represents their interactions. Particularly, high-level graphs represent the patterns in the regions such as hands and face, and fine-level graphs consider the joints of hands and landmarks of facial regions. To learn these graph patterns, a novel deep learning architecture, namely hierarchical spatio-temporal graph neural network (HST-GNN), is proposed. Graph convolutions and graph self-attentions with neighborhood context are proposed to characterize both the local and the global graph properties. Experimental results on benchmark datasets demonstrated the effectiveness of the proposed method.

READ FULL TEXT
research
12/06/2021

Skeletal Graph Self-Attention: Embedding a Skeleton Inductive Bias into Sign Language Production

Recent approaches to Sign Language Production (SLP) have adopted spoken ...
research
10/03/2022

Hierarchical I3D for Sign Spotting

Most of the vision-based sign language research to date has focused on I...
research
11/17/2015

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Deep Recurrent Neural Network architectures, though remarkably capable a...
research
07/23/2021

Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives

It is common practice to represent spoken languages at their phonetic le...
research
05/28/2023

An Open-Source Gloss-Based Baseline for Spoken to Signed Language Translation

Sign language translation systems are complex and require many component...
research
08/27/2020

Adversarial Training for Multi-Channel Sign Language Production

Sign Languages are rich multi-channel languages, requiring articulation ...
research
11/18/2020

Master Thesis: Neural Sign Language Translation by Learning Tokenization

In this thesis, we propose a multitask learning based method to improve ...

Please sign up or login with your details

Forgot password? Click here to reset