Deep learning has made significant strides in video understanding tasks,...
We introduce Sketch-based Video Object Localization (SVOL), a new task a...
Standard multi-modal models assume the use of the same modalities in tra...
In multi-modal action recognition, it is important to consider not only ...
We present a new paradigm named explore-and-match for video grounding, w...
Online action detection, which aims to identify an ongoing action from a...
Visible-infrared person re-identification (VI-ReID) is an important task...
Recent temporal action proposal generation approaches have suggested
int...
Most video person re-identification (re-ID) methods are mainly based on
...
Detecting fashion landmarks is a fundamental technique for visual clothi...