Contrastive learning, which is a powerful technique for learning image-l...
Recent semi-supervised learning (SSL) methods typically include a filter...
This paper aims to remove specular highlights from a single object-level...
Current 3D open-vocabulary scene understanding methods mostly utilize
we...
With the exponential surge in diverse multi-modal data, traditional uni-...
In the real world, image degradations caused by rain often exhibit a
com...
Federated Learning (FL) enables multiple clients to collaboratively lear...
While multi-modal learning has been widely used for MRI reconstruction, ...
Robot-assisted surgery has made significant progress, with instrument
se...
End-to-end weakly supervised semantic segmentation aims at optimizing a
...
Large language models (LLMs) have shown the potential to be integrated i...
Federated learning (FL) facilitates collaborative learning among multipl...
Recent advancements in large language models (LLMs) have transformed the...
Federated learning enables multiple hospitals to cooperatively learn a s...
For robot-assisted surgery, an accurate surgical report reflects clinica...
High-speed optical wireless communication can address the exponential gr...
Cloth-changing person reidentification (ReID) is a newly emerging resear...
Cross-domain Recommendation (CR) has been extensively studied in recent ...
Multi-behavior recommendation, which exploits auxiliary behaviors (e.g.,...
In this paper, we present a Neural Preset technique to address the
limit...
When capturing and storing images, devices inevitably introduce noise.
R...
The Diffusion Probabilistic Model (DPM) has emerged as a highly effectiv...
Diffusion Probabilistic Models have recently shown remarkable performanc...
Masked image modeling (MIM) with transformer backbones has recently been...
In recent years, Denoising Diffusion Models have demonstrated remarkable...
Video dehazing aims to recover haze-free frames with high visibility and...
Shadow removal in a single image has received increasing attention in re...
As the core building block of vision transformers, attention is a powerf...
Current point cloud segmentation architectures suffer from limited long-...
Over the past decade, domain adaptation has become a widely studied bran...
The performance of learning-based denoising largely depends on clean
sup...
Intersections are essential road infrastructures for traffic in modern
m...
With the rapid development of cloud computing, virtual machine schedulin...
With the popularity of cryptocurrencies and the remarkable development o...
Shadows in videos are difficult to detect because of the large shadow
de...
Automated detecting lung infections from computed tomography (CT) data p...
3D Multi-object tracking (MOT) ensures consistency during continuous dyn...
Preoperative and noninvasive prediction of the meningioma grade is impor...
Multi-modal MR imaging is routinely used in clinical practice to diagnos...
Classification activation map (CAM), utilizing the classification struct...
Recent works on image harmonization solve the problem as a pixel-wise im...
Breast lesion detection in ultrasound is critical for breast cancer
diag...
Shared-account Cross-domain Sequential Recommendation (SCSR) task aims t...
One compelling application of artificial intelligence is to generate a v...
Federated semi-supervised learning (FSSL) aims to derive a global model ...
The isocitrate dehydrogenase (IDH) gene mutation is an essential biomark...
In this paper, we propose a novel semi-supervised learning (SSL) framewo...
Weakly supervised object localization (WSOL) focuses on localizing objec...
The security research on Windows has received little attention in the
ac...
Motion prediction is a classic problem in computer vision, which aims at...