Dominant Person Search methods aim to localize and recognize query perso...
Despite significant progress in Text-to-Image (T2I) generative models, e...
Multi-modality fusion and multi-task learning are becoming trendy in 3D
...
As a fundamental aspect of human life, two-person interactions contain
m...
International maritime crime is becoming increasingly sophisticated, oft...
Medical artificial general intelligence (MAGI) enables one foundation mo...
Cycling is a healthy and sustainable mode of transport. However, interac...
Automatic radiology reporting has great clinical potential to relieve
ra...
The goal of Image-to-image (I2I) translation is to transfer an image fro...
Text-guided 3D object generation aims to generate 3D objects described b...
The task of Compositional Zero-Shot Learning (CZSL) is to recognize imag...
Automatic data augmentation (AutoAugment) strategies are indispensable i...
Modeling the ideological perspectives of political actors is an essentia...
We introduce ViLPAct, a novel vision-language benchmark for human activi...
Open-set semi-supervised learning (OSSL) has attracted growing interest,...
The task of action detection aims at deducing both the action category a...
Multivariate long sequence time-series forecasting (M-LSTF) is a practic...
Unsupervised domain adaptation (UDA) methods have been broadly utilized ...
Automatic generation of ophthalmic reports using data-driven neural netw...
Cooperative multi-agent reinforcement learning (MARL) is making rapid
pr...
Due to the superior performance of Graph Neural Networks (GNNs) in vario...
Neural architecture search (NAS) aims to automate architecture engineeri...
Recent advances in vision Transformers (ViTs) have come with a voracious...
Recently, weakly supervised person search is proposed to discard
human-a...
Detecting forgery videos is highly desirable due to the abuse of deepfak...
Knowledge Distillation has shown very promising abil-ity in transferring...
Zero-Shot Learning (ZSL) aims to transfer classification capability from...
Differentiable Architecture Search (DARTS) has received massive attentio...
Zero-Shot Learning (ZSL) aims to transfer learned knowledge from observe...
We propose a novel approach for visual representation learning called
Si...
Recently, tremendous human-designed and automatically searched neural
ne...
Multimedia event detection is the task of detecting a specific event of
...
Dynamic networks have shown their promising capability in reducing
theor...
Political stance detection has become an important task due to the
incre...
Identifying political perspective in news media has become an important ...
With leveraging the weight-sharing and continuous relaxation to enable
g...
Vision-language Navigation (VLN) tasks require an agent to navigate
step...
Person search has drawn increasing attention due to its real-world
appli...
The ability to navigate like a human towards a language-guided target fr...
Current dynamic networks and dynamic pruning methods have shown their
pr...
A myriad of recent breakthroughs in hand-crafted neural architectures fo...
Recent advances in multi-agent reinforcement learning have been largely
...
To reduce the human efforts in neural network design, Neural Architectur...
Linear discriminant analysis (LDA) is a popular technique to learn the m...
Active learning (AL) attempts to maximize the performance gain of the mo...
Most existing tracking methods are based on using a classifier and
multi...
In this paper, we focus on the task of multi-view multi-source
geo-local...
Existing studies for automated melanoma diagnosis are based on single-ti...
Beyond the common difficulties faced in the natural image captioning, me...
Deep learning has made major breakthroughs and progress in many fields. ...