Video activity localization aims at understanding the semantic content i...
Few-shot (FS) and zero-shot (ZS) learning are two different approaches f...
This paper deals with the problem of localizing objects in image and vid...
The recently released Ego4D dataset and benchmark significantly scales a...
This report presents the technical details of our submission to the
EPIC...
Recently, few-shot learning has received increasing interest. Existing
e...
This paper is on video recognition using Transformers. Very recent attem...
Temporal action localization (TAL) is a fundamental yet challenging task...
Many video analysis tasks require temporal localization thus detection o...
We present the submission of Samsung AI Centre Cambridge to the CVPR2020...
Attentive video modeling is essential for action recognition in unconstr...
Most existing object detection methods rely on the availability of abund...
We tackle the problem of finding good architectures for multimodal
class...
This paper addresses the difficult problem of finding an optimal neural
...
Rotoscoping, the detailed delineation of scene elements through a video ...