Expecting intelligent machines to efficiently work in real world require...
This paper focuses on addressing the practical yet challenging problem o...
Multiple object tracking (MOT) has been successfully investigated in com...
Zero-shot skeleton-based action recognition aims to recognize actions of...
Multi-view 3D object detection is becoming popular in autonomous driving...
Understanding and modeling the 3D scene from a single image is a practic...
Vision-language models have achieved tremendous progress far beyond what...
Recent advances in detecting arbitrary objects in the real world are tra...
One-to-one label assignment in object detection has successfully obviate...
Referring image segmentation segments an image from a language expressio...
Federated learning (FL) enables multiple clients to train models
collabo...
Self-supervised learning has demonstrated remarkable capability in
repre...
Real-world data contains a vast amount of multimodal information, among ...
Recent advances in modeling 3D objects mostly rely on synthetic datasets...
Semi-Supervised Semantic Segmentation aims at training the segmentation ...
Retinal fundus images have been applied for the diagnosis and screening ...
Graph convolution networks (GCN) have been widely used in skeleton-based...
When learning from sensitive data, care must be taken to ensure that tra...
Neural surface reconstruction aims to reconstruct accurate 3D surfaces b...
A key challenge to visualization authoring is the process of getting fam...
Although existing machine reading comprehension models are making rapid
...
End-to-end object detection is rapidly progressed after the emergence of...
We present PYSKL: an open-source toolbox for skeleton-based action
recog...
Learning from a few training samples is a desirable ability of an object...
Referring image segmentation is a fundamental vision-language task that ...
Object detection has achieved substantial progress in the last decade.
H...
Federated Learning has shown great potentials for the distributed data
u...
Federated Semi-Supervised Learning (FedSSL) has gained rising attention ...
Most recent semantic segmentation methods adopt a U-Net framework with a...
E-commerce sites strive to provide users the most timely relevant inform...
COVID-19 has hugely changed our lives, work, and interactions with peopl...
Feature reassembly, i.e. feature downsampling and upsampling, is a key
o...
The competition of extracting COVID-19 events from Twitter is to develop...
Patch-based methods and deep networks have been employed to tackle image...
Entity-based semantic search has been widely adopted in modern search en...
This report presents the approach used in the submission of the LVIS
Cha...
We present MMFashion, a comprehensive, flexible and user-friendly open-s...
Promising federated learning coupled with Mobile Edge Computing (MEC) is...
Current object detection frameworks mainly rely on bounding box regressi...
Obtaining the state of the art performance of deep learning models impos...
We present MMDetection, an object detection toolbox that contains a rich...
Feature upsampling is a key operation in a number of modern convolutiona...
Cross-domain sentiment classification (CDSC) is an importance task in do...
Cascade is a classic yet powerful architecture that has boosted performa...
Region anchors are the cornerstone of modern object detection techniques...
High-performance object detection relies on expensive convolutional netw...