Large Vision-Language Models (LVLMs) such as MiniGPT-4 and LLaVA have
de...
To comprehensively assess optical fiber communication system conditions,...
Federated learning (FL) enables collaborative model training across
dist...
Intel processors utilize the retirement to orderly retire the micro-ops ...
Mobile edge computing (MEC) is essential for next-generation mobile netw...
The recently proposed segment anything model (SAM) has made a significan...
In the modern CPU architecture, enhancements such as the Line Fill Buffe...
Caches are used to reduce the speed differential between the CPU and mem...
Recent researches indicate that utilizing the frequency information of i...
Background subtraction (BGS) aims to extract all moving objects in the v...
As the Internet of Things (IoT) continues to evolve, smartphones have be...
Friction-induced vibration (FIV) is very common in engineering areas.
An...
Inspired by masked language modeling (MLM) in natural language processin...
Human parsing is a key topic in image processing with many applications,...
In this paper, we design the first residual type a posteriori error esti...
In this paper, a new iterative two-level algorithm is presented for solv...
Visual tasks vary a lot in their output formats and concerned contents,
...
Information freshness, characterized by age of information (AoI), is
imp...
Previous unsupervised domain adaptation methods did not handle the
cross...
Clustering-based methods, which alternate between the generation of pseu...
Self-supervised learning (SSL) holds promise in leveraging large amounts...
In person re-identification (ReID), very recent researches have validate...
In this paper, we design and analysis a modified weak Galerkin (MWG) fin...
Structural neural network pruning aims to remove the redundant channels ...
3D human pose and shape recovery from a monocular RGB image is a challen...
Transformer has achieved great success in computer vision, while how to ...
Transformer has been widely used for self-supervised pre-training in Nat...
Many recent works have reconstructed distinctive 3D face shapes by
aggre...
Transformer is showing its superiority over convolutional architectures ...
To address the problem of long-tail distribution for the large vocabular...
Existing alignment-based methods have to employ the pretrained human par...
360-degree video streaming provides users with immersive experience by
l...
In mobile edge computing systems, an edge node may have a high load when...
Bike Sharing Systems (BSSs) have been adopted in many major cities of th...
We present a novel and easy-to-implement training framework for visual
t...
Recently, deep learning based facial landmark detection has achieved gre...
This paper reports the demonstration of high-speed PAM-4 transmission us...
Tactile Internet often requires (i) the ultra-reliable and ultra-respons...
Correlation filter (CF) based trackers are currently ranked top in terms...
Recently, correlation filter based trackers (CF trackers) have attracted...
Adaptive bitrate (ABR) streaming enables video users to adapt the playin...
Adaptive bitrate streaming enables video users to adapt their playing
bi...
Pedestrian detection has achieved great improvements in recent years, wh...
In recent years, two types of trackers, namely correlation filter based
...
Image matting plays an important role in image and video editing. Howeve...
Foreground segmentation in video sequences is a classic topic in compute...