Sign Language Video Retrieval with Free-Form Textual Queries

01/07/2022
by   Amanda Duarte, et al.
16

Systems that can efficiently search collections of sign language videos have been highlighted as a useful application of sign language technology. However, the problem of searching videos beyond individual keywords has received limited attention in the literature. To address this gap, in this work we introduce the task of sign language retrieval with free-form textual queries: given a written query (e.g., a sentence) and a large collection of sign language videos, the objective is to find the signing video in the collection that best matches the written query. We propose to tackle this task by learning cross-modal embeddings on the recently introduced large-scale How2Sign dataset of American Sign Language (ASL). We identify that a key bottleneck in the performance of the system is the quality of the sign video embedding which suffers from a scarcity of labeled training data. We, therefore, propose SPOT-ALIGN, a framework for interleaving iterative rounds of sign spotting and feature alignment to expand the scope and scale of available training data. We validate the effectiveness of SPOT-ALIGN for learning a robust sign video embedding through improvements in both sign recognition and the proposed video retrieval task.

READ FULL TEXT
research
11/05/2021

BBC-Oxford British Sign Language Dataset

In this work, we introduce the BBC-Oxford British Sign Language (BOBSL) ...
research
03/22/2023

CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning

This work focuses on sign language retrieval-a recently proposed task fo...
research
03/24/2022

Searching for fingerspelled content in American Sign Language

Natural language processing for sign language video - including tasks li...
research
05/06/2021

Aligning Subtitles in Sign Language Videos

The goal of this work is to temporally align asynchronous subtitles in s...
research
05/23/2023

Slovo: Russian Sign Language Dataset

One of the main challenges of the sign language recognition task is the ...
research
10/11/2020

Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

Continuous sign language recognition (SLR) deals with unaligned video-te...
research
08/08/2023

Gloss Alignment Using Word Embeddings

Capturing and annotating Sign language datasets is a time consuming and ...

Please sign up or login with your details

Forgot password? Click here to reset