Multi-modal Large Language Model (MLLM) refers to a model expanded from ...
Knowledge base question answering (KBQA) is a critical yet challenging t...
Point-cloud-based 3D perception has attracted great attention in various...
Split learning of deep neural networks (SplitNN) has provided a promisin...
Question Answering (QA) is the task of automatically answering questions...
In this work, we consider the deployment of reconfigurable intelligent
s...
Structured tabular data exist across nearly all fields. Reasoning task o...
3D object detection from multiple image views is a fundamental and
chall...
Document-level relation extraction (DocRE) aims to identify semantic lab...
Monocular depth estimation is an essential task in the computer vision
c...
Point clouds and RGB images are two general perceptional sources in
auto...
Entity linking aims to link ambiguous mentions to their corresponding
en...
Monocular 3D object detection (Mono3D) has achieved tremendous improveme...
Distributed sparse deep learning has been widely used in many internet-s...
Monocular 3D object detection (Mono3D) has achieved unprecedented succes...
3D object detection from multiple image views is a fundamental and
chall...
Monocular depth estimation is a fundamental task in computer vision and ...
This paper aims to address the problem of supervised monocular depth
est...
To address the security risk caused by fixed offset mapping and the limi...
Object detection through either RGB images or the LiDAR point clouds has...
The rapid progress of photorealistic synthesis techniques has reached a
...
Pre-training has become a standard paradigm in many computer vision task...
Robotic three-dimensional (3D) ultrasound (US) imaging has been employed...
BBRv2, proposed by Google, aims at addressing BBR's shortcomings of
unfa...
In-network computation has been widely used to accelerate data-intensive...
Ultrasound (US) imaging is widely employed for diagnosis and staging of
...
MPTCP is a new transport protocol that enables mobile devices to use mul...
Since the SARS outbreak in 2003, a lot of predictive epidemiological mod...
Non-invasive continuous alcohol monitoring has potential applications in...
Packet classification according to multi-field ruleset is a key componen...
In this paper, an evolutionary many-objective optimization algorithm bas...
3D steganalysis aims to identify subtle invisible changes produced in
gr...