Wanli Ouyang

research

∙ 08/31/2023

Improving Multiple Sclerosis Lesion Segmentation Across Clinical Sites: A Federated Learning Approach with Noise-Resilient Training

Accurately measuring the evolution of Multiple Sclerosis (MS) with magne...

0 Lei Bai, et al. ∙

research

∙ 08/29/2023

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

We present DiffBIR, which leverages pretrained text-to-image diffusion m...

0 Xinqi Lin, et al. ∙

research

∙ 08/26/2023

Boosting Residual Networks with Group Knowledge

Recent research understands the residual networks from a new perspective...

0 Shengji Tang, et al. ∙

research

∙ 08/21/2023

STEERER: Resolving Scale Variations for Counting and Localization via Selective Inheritance Learning

Scale variation is a deep-rooted problem in object counting, which has n...

0 Tao Han, et al. ∙

research

∙ 08/14/2023

Masked Motion Predictors are Strong 3D Action Representation Learners

In 3D human action recognition, limited supervised data makes it challen...

0 Yunyao Mao, et al. ∙

research

∙ 08/11/2023

Experts Weights Averaging: A New General Training Scheme for Vision Transformers

Structural re-parameterization is a general training scheme for Convolut...

0 Yongqi Huang, et al. ∙

research

∙ 08/06/2023

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

This paper proposes a novel transformer-based framework that aims to enh...

0 Lian Xu, et al. ∙

research

∙ 07/20/2023

Meta-Transformer: A Unified Framework for Multimodal Learning

Multimodal learning aims to build models that can process and relate inf...

0 Yiyuan Zhang, et al. ∙

research

∙ 06/19/2023

MotionGPT: Finetuned LLMs are General-Purpose Motion Generators

Generating realistic human motion from given action descriptions has exp...

0 Yaqi Zhang, et al. ∙

research

∙ 06/19/2023

UniG3D: A Unified 3D Object Generation Dataset

The field of generative AI has a transformative impact on various areas,...

0 Qinghong Sun, et al. ∙

research

∙ 06/15/2023

Adaptive Hierarchical SpatioTemporal Network for Traffic Forecasting

Accurate traffic forecasting is vital to intelligent transportation syst...

0 Yirong Chen, et al. ∙

research

∙ 06/13/2023

Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions

Human intelligence can retrieve any person according to both visual and ...

0 Weizhen He, et al. ∙

research

∙ 06/11/2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

Large language models have become a potential pathway toward achieving a...

0 Zhenfei Yin, et al. ∙

research

∙ 06/02/2023

Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

LiDAR and Radar are two complementary sensing approaches in that LiDAR s...

0 Yingjie Wang, et al. ∙

research

∙ 05/10/2023

Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification

Clothes-invariant feature extraction is critical to the clothes-changing...

0 Xulin Li, et al. ∙

research

∙ 05/04/2023

Stimulative Training++: Go Beyond The Performance Limits of Residual Networks

Residual networks have shown great success and become indispensable in r...

0 Peng Ye, et al. ∙

research

∙ 04/25/2023

Seeing is not always believing: A Quantitative Study on Human Perception of AI-Generated Images

Photos serve as a way for humans to record what they experience in their...

0 zeyu-lu, et al. ∙

research

∙ 04/06/2023

FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead

We present FengWu, an advanced data-driven global medium-range weather f...

0 Kang Chen, et al. ∙

research

∙ 03/22/2023

Automatically Predict Material Properties with Microscopic Image Example Polymer Compatibility

Many material properties are manifested in the morphological appearance ...

0 Zhilong Liang, et al. ∙

research

∙ 03/10/2023

HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

Human-centric perceptions include a variety of vision tasks, which have ...

0 Shixiang Tang, et al. ∙

research

∙ 03/06/2023

UniHCP: A Unified Model for Human-Centric Perceptions

Human-centric perceptions (e.g., pose estimation, human parsing, pedestr...

0 Yuanzheng Ci, et al. ∙

research

∙ 03/03/2023

Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase

Synthesizing controllable motion for a character using deep learning has...

0 Lintao Wang, et al. ∙

research

∙ 02/22/2023

Saliency Guided Contrastive Learning on Scene Images

Self-supervised learning holds promise in leveraging large numbers of un...

0 Meilin Chen, et al. ∙

research

∙ 01/29/2023

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Recently, perception task based on Bird's-Eye View (BEV) representation ...

0 Yangguang Li, et al. ∙

research

∙ 01/16/2023

β-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search

Neural Architecture Search has attracted increasing attention in recent ...

0 Peng Ye, et al. ∙

research

∙ 12/31/2022

Ponder: Point Cloud Pre-training via Neural Rendering

We propose a novel approach to self-supervised learning of point cloud r...

0 Di Huang, et al. ∙

research

∙ 12/20/2022

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency

Masked Modeling (MM) has demonstrated widespread success in various visi...

0 Mingye Xu, et al. ∙

research

∙ 12/17/2022

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

The recent success of pre-trained 2D vision models is mostly attributabl...

0 Yuan Yao, et al. ∙

research

∙ 12/08/2022

Frozen CLIP Model is An Efficient Point Cloud Backbone

The pretraining-finetuning paradigm has demonstrated great success in NL...

0 Xiaoshui Huang, et al. ∙

research

∙ 12/06/2022

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

Despite the tremendous progress of Masked Autoencoders (MAE) in developi...

0 Honghui Yang, et al. ∙

research

∙ 11/30/2022

Reconstructing Hand-Held Objects from Monocular Video

This paper presents an approach that reconstructs a hand-held object fro...

0 Di Huang, et al. ∙

research

∙ 11/17/2022

3D-QueryIS: A Query-based Framework for 3D Instance Segmentation

Previous top-performing methods for 3D instance segmentation often maint...

0 Jiaheng Liu, et al. ∙

research

∙ 11/14/2022

Boosting Semi-Supervised 3D Object Detection with Semi-Sampling

Current 3D object detection methods heavily rely on an enormous amount o...

0 Xiaopei Wu, et al. ∙

research

∙ 10/11/2022

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition

Long-tail distribution is widely spread in real-world applications. Due ...

8 Jingru Tan, et al. ∙

research

∙ 10/09/2022

Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

Residual networks have shown great success and become indispensable in t...

6 Peng Ye, et al. ∙

research

∙ 10/03/2022

CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training

Pre-training across 3D vision and language remains under development bec...

20 Tianyu Huang, et al. ∙

research

∙ 09/23/2022

Towards Frame Rate Agnostic Multi-Object Tracking

Multi-Object Tracking (MOT) is one of the most fundamental computer visi...

10 Weitao Feng, et al. ∙

research

∙ 08/23/2022

ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild

This paper investigates the task of 2D whole-body human pose estimation,...

6 Lumin Xu, et al. ∙

research

∙ 08/15/2022

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

Image-based 3D detection is an indispensable component of the perception...

5 Xinzhu Ma, et al. ∙

research

∙ 07/29/2022

Fine-grained Retrieval Prompt Tuning

Fine-grained object retrieval aims to learn discriminative representatio...

8 Shijie Wang, et al. ∙

research

∙ 07/22/2022

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal

Estimating 3D interacting hand pose from a single RGB image is essential...

7 Hao Meng, et al. ∙

research

∙ 07/21/2022

Pose for Everything: Towards Category-Agnostic Pose Estimation

Existing works on 2D pose estimation mainly focus on a certain category,...

6 Lumin Xu, et al. ∙

research

∙ 07/17/2022

Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial Patches

Contrastive-based self-supervised learning methods achieved great succes...

8 Yuanzheng Ci, et al. ∙

research

∙ 06/14/2022

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

In this work, we explore neat yet effective Transformer-based frameworks...

19 Jiajun Deng, et al. ∙

research

∙ 06/13/2022

Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation

Knowledge distillation (KD) has shown very promising capabilities in tra...

37 Zengyu Qiu, et al. ∙

research

∙ 05/10/2022

Domain Invariant Masked Autoencoders for Self-supervised Learning from Multi-domains

Generalizing learned representations across significantly different visu...

13 Haiyang Yang, et al. ∙

research

∙ 05/03/2022

MS Lesion Segmentation: Revisiting Weighting Mechanisms for Federated Learning

Federated learning (FL) has been widely employed for medical image analy...

7 Dongnan Liu, et al. ∙

research

∙ 04/19/2022

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer

Vision transformers have achieved great successes in many computer visio...

9 Wang Zeng, et al. ∙

research

∙ 04/04/2022

Unsupervised Learning of Accurate Siamese Tracking

Unsupervised learning has been popular in various computer vision tasks,...

16 Qiuhong Shen, et al. ∙

research

∙ 03/25/2022

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance

Recent years have witnessed the success of deep learning on the visual s...

8 Xinchi Zhou, et al. ∙

Wanli Ouyang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro