This paper addresses the temporal sentence grounding (TSG). Although exi...
Given an untrimmed video, temporal sentence localization (TSL) aims to
l...
Temporal sentence grounding (TSG) aims to identify the temporal boundary...
New architecture GPUs like A100 are now equipped with multi-instance GPU...
Crowd counting is a regression task that estimates the number of people ...
Temporal video grounding (TVG) aims to localize a target segment in a vi...
Temporal sentence grounding (TSG) is crucial and fundamental for video
u...
Lymph node station (LNS) delineation from computed tomography (CT) scans...
Recent advances in deep convolutional neural networks (DCNNs) have shown...
Synthesis of face images from visual attributes is an important problem ...
Thermal face imagery, which captures the naturally emitted heat from the...
Thermal-to-visible face verification is a challenging problem due to the...
Automatic synthesis of faces from visual attributes is an important prob...
Polarimetric thermal to visible face verification entails matching two i...
Thermal to visible face verification is a challenging problem due to the...
Automatic synthesis of faces from visual attributes is an important prob...
Facial landmarks constitute the most compressed representation of faces ...