Automatic dense annotation of large-vocabulary sign language videos

08/04/2022
by   Liliane Momeni, et al.
1

Recently, sign language researchers have turned to sign language interpreted TV broadcasts, comprising (i) a video of continuous signing and (ii) subtitles corresponding to the audio content, as a readily available and large-scale source of training data. One key challenge in the usability of such data is the lack of sign annotations. Previous work exploiting such weakly-aligned data only found sparse correspondences between keywords in the subtitle and individual signs. In this work, we propose a simple, scalable framework to vastly increase the density of automatic annotations. Our contributions are the following: (1) we significantly improve previous annotation methods by making use of synonyms and subtitle-signing alignment; (2) we show the value of pseudo-labelling from a sign recognition model as a way of sign spotting; (3) we propose a novel approach for increasing our annotations of known and unknown classes based on in-domain exemplars; (4) on the BOBSL BSL sign language corpus, we increase the number of confident automatic annotations from 670K to 5M. We make these annotations publicly available to support the sign language research community.

READ FULL TEXT

page 13

page 28

page 29

research
05/06/2021

Aligning Subtitles in Sign Language Videos

The goal of this work is to temporally align asynchronous subtitles in s...
research
08/08/2023

Gloss Alignment Using Word Embeddings

Capturing and annotating Sign language datasets is a time consuming and ...
research
11/16/2022

Weakly-supervised Fingerspelling Recognition in British Sign Language Videos

The goal of this work is to detect and recognize sequences of letters si...
research
04/28/2021

Sign Segmentation with Changepoint-Modulated Pseudo-Labelling

The objective of this work is to find temporal boundaries between signs ...
research
03/21/2023

Self-Sufficient Framework for Continuous Sign Language Recognition

The goal of this work is to develop self-sufficient framework for Contin...
research
07/23/2020

BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues

Recent progress in fine-grained gesture and action classification, and m...
research
04/14/2022

Open Source HamNoSys Parser for Multilingual Sign Language Encoding

This paper presents our recent developments in the field of automatic pr...

Please sign up or login with your details

Forgot password? Click here to reset