Composed Image Retrieval (CoIR) has recently gained popularity as a task...
In this paper, we present TMR, a simple yet effective approach for text ...
Our goal is to synthesize 3D human motions given textual inputs describi...
Large-scale pre-trained Vision Language (VL) models have shown remar...
The objective of this paper is an automatic Audio Description (AD) model...
The goal of this work is to detect and recognize sequences of letters si...
Given a series of natural language descriptions, our task is to generate...
Recently, sign language researchers have turned to sign language interpr...
Our goal in this paper is the adaptation of image-text models for long v...
The focus of this work is sign spotting - given a video of an
isolated s...
We address the problem of generating diverse 3D human motions from textu...
Systems that can efficiently search collections of sign language videos ...
In this work, we introduce the BBC-Oxford British Sign Language (BOBSL)
...
Our work aims to obtain 3D reconstruction of hands and manipulated objec...
The goal of this work is to temporally align asynchronous subtitles in s...
The objective of this work is to find temporal boundaries between signs ...
We tackle the problem of action-conditioned generation of realistic and
...
Our objective in this work is video-text retrieval - in particular a joi...
The objective of this work is to annotate sign instances across a broad
...
The objective of this work is to determine the location of temporal
boun...
The focus of this work is sign spotting - given a video of an isolated s...
Recent progress in fine-grained gesture and action classification, and
m...
Our goal in this work is to improve the performance of human action
reco...
Estimating hand-object manipulations is essential for interpreting and
i...
Human shape estimation is an important task for video editing, animation...
Estimating human pose, shape, and motion from images and videos are
fund...
Typical human actions last several seconds and exhibit characteristic
sp...
Computer vision has a great potential to help our daily lives by searchi...