Xiaogang Wang

research

∙ 08/21/2023

CoNe: Contrast Your Neighbours for Supervised Image Classification

Image classification is a longstanding problem in computer vision and ma...

0 Mingkai Zheng, et al. ∙

research

∙ 06/08/2023

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process

Image recognition and generation have long been developed independently ...

1 Changyao Tian, et al. ∙

research

∙ 06/08/2023

FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow

This paper introduces a novel transformer-based network architecture, Fl...

0 Zhaoyang Huang, et al. ∙

research

∙ 05/31/2023

A Unified Conditional Framework for Diffusion-based Image Restoration

Diffusion Probabilistic Models (DPMs) have recently shown remarkable per...

0 Yi Zhang, et al. ∙

research

∙ 05/25/2023

Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments via Large Language Models with Text-based Knowledge and Memory

The captivating realm of Minecraft has attracted substantial research in...

0 Xizhou Zhu, et al. ∙

research

∙ 03/29/2023

Real-time Controllable Denoising for Image and Video

Controllable image denoising aims to generate clean samples with human p...

0 Zhaoyang Zhang, et al. ∙

research

∙ 03/06/2023

KBNet: Kernel Basis Network for Image Restoration

How to aggregate spatial information plays an essential role in learning...

0 Yi Zhang, et al. ∙

research

∙ 11/17/2022

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Despite the remarkable success of foundation models, their task-specific...

30 Hao Li, et al. ∙

research

∙ 11/17/2022

Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information

To effectively exploit the potential of large-scale models, various pre-...

0 Weijie Su, et al. ∙

research

∙ 11/10/2022

Demystify Transformers Convolutions in Modern Image Deep Networks

Recent success of vision transformers has inspired a series of vision ba...

0 Jifeng Dai, et al. ∙

research

∙ 11/10/2022

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Compared to the great progress of large-scale vision transformers (ViTs)...

0 Wenhai Wang, et al. ∙

research

∙ 09/19/2022

Magnetic Resonance Fingerprinting with compressed sensing and distance metric learning

Magnetic Resonance Fingerprinting (MRF) is a novel technique that simult...

7 Zhe Wang, et al. ∙

research

∙ 08/23/2022

ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild

This paper investigates the task of 2D whole-body human pose estimation,...

6 Lumin Xu, et al. ∙

research

∙ 08/10/2022

Learning Degradation Representations for Image Deblurring

In various learning-based image restoration tasks, such as image denoisi...

2 Dasong Li, et al. ∙

research

∙ 08/06/2022

Frozen CLIP Models are Efficient Video Learners

Video recognition has been dominated by the end-to-end learning paradigm...

0 Ziyi Lin, et al. ∙

research

∙ 08/03/2022

Online decentralized tracking for nonlinear time-varying optimal power flow of coupled transmission-distribution grids

The coordinated alternating current optimal power flow (ACOPF) for coupl...

0 Wentian Lu, et al. ∙

research

∙ 07/28/2022

A Hybrid Complex-valued Neural Network Framework with Applications to Electroencephalogram (EEG)

In this article, we present a new EEG signal classification framework by...

0 Hang Du, et al. ∙

research

∙ 07/21/2022

Pose for Everything: Towards Category-Agnostic Pose Estimation

Existing works on 2D pose estimation mainly focus on a certain category,...

6 Lumin Xu, et al. ∙

research

∙ 07/07/2022

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space

This paper addresses an important problem of ranking the pre-trained dee...

0 Wenqi Shao, et al. ∙

research

∙ 06/25/2022

BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling

The learning and aggregation of multi-scale features are essential in em...

10 Yechao Bai, et al. ∙

research

∙ 06/22/2022

No Attention is Needed: Grouped Spatial-temporal Shift for Simple and Efficient Video Restorers

Video restoration, aiming at restoring clear frames from degraded videos...

0 Dasong Li, et al. ∙

research

∙ 06/19/2022

3D Object Detection for Autonomous Driving: A Review and New Outlooks

Autonomous driving, in recent years, has been receiving increasing atten...

0 Jiageng Mao, et al. ∙

research

∙ 06/09/2022

Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs

To build an artificial neural network like the biological intelligence s...

0 Jinguo Zhu, et al. ∙

research

∙ 06/02/2022

Siamese Image Modeling for Self-Supervised Vision Representation Learning

Self-supervised learning (SSL) has delivered superior performance on a v...

0 Chenxin Tao, et al. ∙

research

∙ 05/10/2022

Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network

With the growing popularity of smartphones, capturing high-quality image...

0 Dasong Li, et al. ∙

research

∙ 04/19/2022

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer

Vision transformers have achieved great successes in many computer visio...

9 Wang Zeng, et al. ∙

research

∙ 03/29/2022

Learning a Structured Latent Space for Unsupervised Point Cloud Completion

Unsupervised point cloud completion aims at estimating the corresponding...

0 Yingjie Cai, et al. ∙

research

∙ 03/25/2022

Point2Seq: Detecting 3D Objects as Sequences

We present a simple and effective framework, named Point2Seq, for 3D obj...

0 Yujing Xue, et al. ∙

research

∙ 03/24/2022

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization

6-DoF object pose estimation from a monocular image is challenging, and ...

0 Yan Xu, et al. ∙

research

∙ 03/16/2022

Relational Self-Supervised Learning

Self-supervised Learning (SSL) including the mainstream contrastive lear...

0 Mingkai Zheng, et al. ∙

research

∙ 02/27/2022

Robust Self-Supervised LiDAR Odometry via Representative Structure Discovery and 3D Inherent Error Modeling

The correct ego-motion estimation basically relies on the understanding ...

0 Yan Xu, et al. ∙

research

∙ 01/13/2022

Learning Semantic Abstraction of Shape via 3D Region of Interest

In this paper, we focus on the two tasks of 3D shape abstraction and sem...

14 Haiyue Fang, et al. ∙

research

∙ 12/05/2021

Dynamic Token Normalization Improves Vision Transformer

Vision Transformer (ViT) and its variants (e.g., Swin, PVT) have achieve...

13 Wenqi Shao, et al. ∙

research

∙ 12/02/2021

Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks

Biological intelligence systems of animals perceive the world by integra...

4 Xizhou Zhu, et al. ∙

research

∙ 11/29/2021

IDR: Self-Supervised Image Denoising via Iterative Data Refinement

The lack of large-scale noisy-clean image pairs restricts supervised den...

2 Yi Zhang, et al. ∙

research

∙ 11/26/2021

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

Deep learning-based models encounter challenges when processing long-tai...

0 Changyao Tian, et al. ∙

research

∙ 11/24/2021

GreedyNASv2: Greedier Search with a Greedy Path Filter

Training a good supernet in one-shot NAS methods is difficult since the ...

0 Tao Huang, et al. ∙

research

∙ 10/10/2021

Weakly Supervised Contrastive Learning

Unsupervised visual representation learning has gained much attention fr...

0 Mingkai Zheng, et al. ∙

research

∙ 10/10/2021

Rethinking Noise Synthesis and Modeling in Raw Denoising

The lack of large-scale real raw image denoising dataset gives rise to c...

0 Yi Zhang, et al. ∙

research

∙ 09/07/2021

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Transformer, as a strong and flexible architecture for modelling long-ra...

8 Rui Liu, et al. ∙

research

∙ 08/23/2021

Voxel-based Network for Shape Completion by Leveraging Edge Generation

Deep learning technique has yielded significant improvements in point cl...

0 Xiaogang Wang, et al. ∙

research

∙ 08/18/2021

LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector

Stereo-based 3D detection aims at detecting 3D object bounding boxes fro...

0 Xiaoyang Guo, et al. ∙

research

∙ 07/20/2021

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Self-supervised Learning (SSL) including the mainstream contrastive lear...

0 Mingkai Zheng, et al. ∙

research

∙ 06/25/2021

Vision Transformer Architecture Search

Recently, transformers have shown great superiority in solving computer ...

5 Xiu Su, et al. ∙

research

∙ 06/04/2021

Scalable Transformers for Neural Machine Translation

Transformer has been widely adopted in Neural Machine Translation (NMT) ...

0 Peng Gao, et al. ∙

research

∙ 05/21/2021

ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search

Human pose estimation has achieved significant progress in recent years....

3 Lumin Xu, et al. ∙

research

∙ 04/22/2021

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

While accurate lip synchronization has been achieved for arbitrary-subje...

13 Hang Zhou, et al. ∙

research

∙ 04/14/2021

Decoupled Spatial-Temporal Transformer for Video Inpainting

Video inpainting aims to fill the given spatiotemporal holes with realis...

27 Rui Liu, et al. ∙

research

∙ 04/13/2021

Visually Informed Binaural Audio Generation without Binaural Audios

Stereophonic audio, especially binaural audio, plays an essential role i...

0 Xudong Xu, et al. ∙

research

∙ 04/08/2021

Semantic Scene Completion via Integrating Instances and Scene in-the-Loop

Semantic Scene Completion aims at reconstructing a complete 3D scene wit...

0 Yingjie Cai, et al. ∙

Xiaogang Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro