We study the task of zero-shot vision-and-language navigation (ZS-VLN), ...
Vision-and-language navigation (VLN) requires an embodied agent to navig...
We study the task of weakly-supervised point cloud semantic segmentation...
Neural Radiance Fields (NeRF) is a revolutionary approach for rendering
...
Adversarial detection aims to determine whether a given sample is an
adv...
Source-free Unsupervised Domain Adaptation (SF-UDA) aims to adapt a
well...
Vision foundation models exhibit impressive power, benefiting from the
e...
Zero-shot quantization (ZSQ) is promising for compressing and accelerati...
Test-time adaptation (TTA) has shown to be effective at tackling distrib...
Neural Architecture Search (NAS) aims to automatically find effective
ar...
Designing feasible and effective architectures under diverse computation...
We address a practical yet challenging problem of training robot agents ...
Getting robots to navigate to multiple objects autonomously is essential...
We study self-supervised video representation learning that seeks to lea...
Deep learning (DL) has made significant progress in angle closure
classi...
On the medical images, many of the tissues/lesions may be ambiguous. Tha...
Deep Metric Learning (DML) serves to learn an embedding function to proj...
This paper studies a new, practical but challenging problem, called
Clas...
Deep neural networks have exhibited remarkable performance in image
supe...
Test-time adaptation (TTA) seeks to tackle potential distribution shifts...
Conventional deep models predict a test sample with a single forward
pro...
Temporal action localization has long been researched in computer vision...
Existing Voice Cloning (VC) tasks aim to convert a paragraph text to a s...
Instance segmentation in 3D scenes is fundamental in many applications o...
We study a new challenging problem of efficient deployment for diverse t...
In real-world applications, data often come in a growing manner, where t...
Convolutional Neural Networks (CNNs) have achieved great success due to ...
Deep neural networks (DNNs) are vulnerable to adversarial examples that ...
Deep neural networks (DNNs) are vulnerable to adversarial examples with ...
One of the key steps in Neural Architecture Search (NAS) is to estimate ...
Designing feasible and effective architectures under diverse computation...
Designing effective architectures is one of the key factors behind the
s...
View synthesis aims to produce unseen views from a set of views captured...
We present Automatic Bit Sharing (ABS) to automatically search for optim...
Reading and writing research papers is one of the most privileged abilit...
Batch Normalization (BN) has been a standard component in designing deep...
Person re-identification (Re-ID) via gait features within 3D skeleton
se...
Gait-based person re-identification (Re-ID) is valuable for safety-criti...
We addressed the challenging task of video question answering, which req...
Generative adversarial networks (GANs) have shown remarkable success in
...
The last decade has witnessed remarkable progress in the image captionin...
Network quantization aims to lower the bitwidth of weights and activatio...
We focus on the task of generating sound from natural videos, and the so...
Deep neural networks have achieved remarkable success for video-based ac...
Angle closure glaucoma (ACG) is a more aggressive disease than open-angl...
The outbreak of novel coronavirus disease 2019 (COVID-19) has already
in...
We address the problem of video grounding from natural language queries....
Recently, deep neural networks (DNNs) have made great progress on automa...
Neural architecture search (NAS) has gained increasing attention in the
...
Deep neural networks have exhibited promising performance in image
super...