Domain adaptation (DA) aims to transfer knowledge from a fully labeled s...
Recent advances in Scene Graph Generation (SGG) typically model the
rela...
Network-based intrusion detection system (NIDS) monitors network traffic...
Dynamic scene graphs generated from video clips could help enhance the
s...
We propose a new general model called IPNN - Indeterminate Probability N...
Video-Text Pre-training (VTP) aims to learn transferable representations...
Recent works on unsupervised domain adaptation (UDA) focus on the select...
Graph convolutional networks (GCNs) are widely adopted in skeleton-based...
Unsupervised video representation learning has made remarkable achieveme...
Video grounding aims to localize the temporal segment corresponding to a...
Weakly-Supervised Temporal Action Localization (WSTAL) aims to localize
...
Efficient long-short temporal modeling is key for enhancing the performa...
Temporal action detection (TAD) is a challenging task which aims to
temp...
Arbitrary-shaped text detection is a challenging task since curved texts...
Human-Object Interaction (HOI) detection devotes to learn how humans int...
Weakly-supervised temporal action localization (WS-TAL) aims to localize...
RGB-Infrared person re-identification (RGB-IR Re-ID) aims to match perso...
Efficiently modeling dynamic motion information in videos is crucial for...
Foreseeing the future is one of the key factors of intelligence. It invo...