Make-up temporal video grounding (MTVG) aims to localize the target vide...
Multi-sensor modal fusion has demonstrated strong advantages in 3D objec...
In this paper, we present our solution to the MuSe-Personalisation
sub-c...
Remote photoplethysmography (rPPG) based physiological measurement is an...
The video grounding (VG) task aims to locate the queried action or event...
In this paper, we present the solution of our team HFUT-VUT for the
Mult...
Building a multi-modality multi-task neural network toward accurate and
...
In this paper, we briefly introduce the solution of our team HFUT-VUT fo...
Video-based human pose transfer is a video-to-video generation task that...
Deep learning technology has made great achievements in the field of ima...
Interactive image segmentation aims to segment the target from the backg...
This paper presents our 2nd place solution for the NuPlan Challenge 2023...
Recent learning-based approaches have achieved significant progress in l...
Malware detection models based on deep learning have been widely used, b...
Building end-to-end task bots and maintaining their integration with new...
The lack of freely available standardized datasets represents an aggrava...
As robotics technology advances, dense point cloud maps are increasingly...
Recently, cross-source point cloud registration from different sensors h...
Naturally controllable human-scene interaction (HSI) generation has an
i...
Stencil computation is one of the most important kernels in various
scie...
In this manuscript (ms), we propose causal inference based single-branch...
Visual question answering (VQA) is an important and challenging multimod...
Image-based multi-person reconstruction in wide-field large scenes is
cr...
Cross-matching operation, which is to find corresponding data for the sa...
The electrification of shared mobility has become popular across the glo...
This letter attempts to design a surveillance scheme by adopting an acti...
Nowadays, online screen sharing and remote cooperation are becoming
ubiq...
In modern SD-WAN networks, a global controller continuously optimizes
ap...
This paper investigates the physical-layer security in a Virtual Antenna...
A non-intrusive model order reduction (MOR) method for solving parameter...
The advent of deep learning has led to significant progress in monocular...
In this paper, we introduce HDhuman, a method that addresses the challen...
Emotion recognition is a challenging and actively-studied research area ...
There have been two streams in the 3D detection from point clouds:
singl...
Nowadays, there is an explosive growth of screen contents due to the wid...
Clustering of hyperspectral images is a fundamental but challenging task...
The big data about music history contains information about time and use...
Channel knowledge map (CKM) is an emerging technique to enable
environme...
In this report, we introduce our winning solution to the Real-time 3D
De...
We investigate the effectiveness of different machine learning methodolo...
Second language (L2) English learners often find it difficult to improve...
User queries for a real-world dialog system may sometimes fall outside t...
Intent understanding plays an important role in dialog systems, and is
t...
As the data size in Machine Learning fields grows exponentially, it is
i...
This paper is concerned with the design of a non-intrusive model order
r...
Stencil computation is one of the most important kernels in various
scie...
Stencil computation is one of the most important kernels in various
scie...
Person image synthesis, e.g., pose transfer, is a challenging problem du...
Human pose transfer, which aims at transferring the appearance of a give...
Human pose transfer, as a misaligned image generation task, is very
chal...