Video captioning aims to describe the content of videos using natural
la...
Large-scale pre-trained multi-modal models (e.g., CLIP) demonstrate stro...
This paper describes our champion solution for the CVPR2022 Generic Even...
This report presents the algorithm used in the submission of Generic Eve...
Generic Event Boundary Detection (GEBD) aims to detect moments where hum...
Generic event boundary detection aims to localize the generic, taxonomy-...
Automatic security inspection using computer vision technology is a
chal...
Crowd counting on the drone platform is an interesting topic in computer...
This report presents the approach used in the submission of Generic Even...
To promote the developments of object detection, tracking and counting
a...
Pig counting is a crucial task for large-scale pig farming, which is usu...
Unsupervised domain adaptation is critical in various computer vision ta...
The convention standard for object detection uses a bounding box to repr...
Drone equipped with cameras can dynamically track the target in the air ...
Drones, or general UAVs, equipped with cameras have been fast deployed w...
The majority of existing human parsing methods formulate the task as sem...
In this paper, we present a novel siamese motion-aware network (SiamMan)...
This paper proposes a space-time multi-scale attention network (STANet) ...
Fine-grained visual categorization (FGVC) is an important but challengin...
Object detection and counting are related but challenging problems,
espe...
Multi-view subspace clustering aims to discover the inherent structure b...
The ChaLearn large-scale gesture recognition challenge has been run twic...
Automatic Check-Out (ACO) receives increased interests in recent years. ...
In this paper, we present a unified, end-to-end trainable spatiotemporal...
The majority of Multi-Object Tracking (MOT) algorithms based on the
trac...
Video style transfer is a useful component for applications such as augm...
Current state-of-the-art object objectors are fine-tuned from the
off-th...
Pedestrian detection in crowded scenes is a challenging problem since th...
In this paper we present a large-scale visual object detection and track...
For object detection, the two-stage approach (e.g., Faster R-CNN) has be...
Inconsistency in contrast enhancement can be used to expose image forger...
Graph based representation is widely used in visual tracking field by fi...
In recent years, numerous effective multi-object tracking (MOT) methods ...