Gloss-Free End-to-End Sign Language Translation

05/22/2023
by   Kezhou Lin, et al.
0

In this paper, we tackle the problem of sign language translation (SLT) without gloss annotations. Although intermediate representation like gloss has been proven effective, gloss annotations are hard to acquire, especially in large quantities. This limits the domain coverage of translation datasets, thus handicapping real-world applications. To mitigate this problem, we design the Gloss-Free End-to-end sign language translation framework (GloFE). Our method improves the performance of SLT in the gloss-free setting by exploiting the shared underlying semantics of signs and the corresponding spoken translation. Common concepts are extracted from the text and used as a weak form of intermediate representation. The global embedding of these concepts is used as a query for cross-attention to find the corresponding information within the learned visual features. In a contrastive manner, we encourage the similarity of query results between samples containing such concepts and decrease those that do not. We obtained state-of-the-art results on large-scale datasets, including OpenASL and How2Sign. The code and model will be available at https://github.com/HenryLittle/GloFE.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2023

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

Sign Language Translation (SLT) is a challenging task due to its cross-d...
research
07/14/2023

Gloss Attention for Gloss-free Sign Language Translation

Most sign language translation (SLT) methods to date require the use of ...
research
05/25/2022

Open-Domain Sign Language Translation Learned from Online Video

Existing work on sign language translation–that is, translation from sig...
research
04/21/2023

Better Sign Language Translation with Monolingual Data

Sign language translation (SLT) systems, which are often decomposed into...
research
05/02/2023

SLTUNET: A Simple Unified Model for Sign Language Translation

Despite recent successes with neural models for sign language translatio...
research
10/12/2020

TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation

Sign language translation (SLT) aims to interpret sign video sequences i...
research
05/23/2023

BM25 Query Augmentation Learned End-to-End

Given BM25's enduring competitiveness as an information retrieval baseli...

Please sign up or login with your details

Forgot password? Click here to reset