In this study, we address the challenge of 3D scene structure recovery f...
Cross-domain pedestrian detection aims to generalize pedestrian detector...
Domain shifts such as sensor type changes and geographical situation
var...
Fish feeding intensity assessment (FFIA) aims to evaluate the intensity
...
3D dense captioning requires a model to translate its understanding of a...
Accurate segmentation of clustered microcalcifications in mammography is...
Recent research understands the residual networks from a new perspective...
Given the long textual product information and the product image, Multi-...
Structural re-parameterization is a general training scheme for Convolut...
Reinforcement learning is time-consuming for complex tasks due to the ne...
Due to the successful development of deep image generation technology, v...
Exploration and reward specification are fundamental and intertwined
cha...
Few-shot video object segmentation (FSVOS) aims to segment dynamic objec...
Though the advancement of pre-trained large language models unfolds, the...
Reinforcement learning (RL) has shown promise in creating robust policie...
Predicting the performance of highly configurable software systems is th...
It is a long-term vision for Autonomous Driving (AD) community that the
...
The recent advancements in image-text diffusion models have stimulated
r...
Adversarial attack is commonly regarded as a huge threat to neural netwo...
The transformer extends its success from the language to the vision doma...
Weakly supervised semantic segmentation (WSSS) models relying on class
a...
Residual networks have shown great success and become indispensable in r...
Semi-supervised semantic segmentation aims to learn from a small amount ...
Point cloud based 3D deep model has wide applications in many applicatio...
Medical image segmentation is a challenging task with inherent ambiguity...
In recent years, research on few-shot learning (FSL) has been fast-growi...
Due to the emergence of powerful computing resources and large-scale
ann...
In this work, we aim to learn dexterous manipulation of deformable objec...
We study the problem of object retrieval in scenarios where visual sensi...
Global channel pruning (GCP) aims to remove a subset of channels (filter...
Current 3D object detection models follow a single dataset-specific trai...
Unsupervised Domain Adaptation (UDA) technique has been explored in 3D
c...
Object detection on VHR remote sensing images plays a vital role in
appl...
Building 3D maps of the environment is central to robot navigation, plan...
Few-shot semantic segmentation is the task of learning to locate each pi...
Neural Radiance Fields (NeRF) has achieved impressive results in single
...
Neural Architecture Search has attracted increasing attention in recent
...
Configurable software systems can be tuned for better performance. Lever...
3D dense captioning aims to generate multiple captions localized with th...
Medical Visual Question Answering (Medical-VQA) aims to to answer clinic...
State-of-the-art 3D semantic segmentation models are trained on the
off-...
In this paper, we introduce a novel approach for ground plane normal
est...
We study a challenging task, conditional human motion generation, which
...
The linear ensemble based strategy, i.e., averaging ensemble, has been
p...
There are still many challenges of emotion recognition using physiologic...
Implicit neural 3D representation has achieved impressive results in sur...
Retrieval augmentation has shown promising improvements in different tas...
Residual networks have shown great success and become indispensable in
t...
Logical rules, both transferable and explainable, are widely used as wea...
Non-functional bugs (e.g., performance- or accuracy-related bugs) in Dee...