We investigate the potential of learning visual representations using
sy...
Contrastive Language-Image Pre-training (CLIP) stands as one of the most...
Loss of packets in video conferencing often results in poor quality and ...
Humans possess a versatile mechanism for extracting structured
represent...
Contrastive learning (CL) can learn generalizable feature representation...
This paper aims to caption daily life –i.e., to create a textual descrip...
Person Re-Identification (ReID) aims to recognize a person-of-interest a...
Understanding people's actions and interactions typically depends on see...
The recent advances in deep learning have made it possible to generate
p...
Despite the recent success of end-to-end learned representations,
hand-c...
Our experiment adapts several popular deep learning methods as well as s...