Lewei Lu

research

∙ 06/08/2023

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process

Image recognition and generation have long been developed independently ...

1 Changyao Tian, et al. ∙

research

∙ 06/05/2023

Scene as Occupancy

Human driver can easily describe the complex traffic scene by visual sys...

0 Wenwen Tong, et al. ∙

research

∙ 05/25/2023

Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments via Large Language Models with Text-based Knowledge and Memory

The captivating realm of Minecraft has attracted substantial research in...

0 Xizhou Zhu, et al. ∙

research

∙ 03/18/2023

3D Data Augmentation for Driving Scenes on Camera

Driving scenes are extremely diverse and complicated that it is impossib...

1 Wenwen Tong, et al. ∙

research

∙ 03/14/2023

Modeling Continuous Motion for 3D Point Cloud Object Tracking

The task of 3D single object tracking (SOT) with LiDAR point clouds is c...

0 Zhipeng Luo, et al. ∙

research

∙ 12/20/2022

Goal-oriented Autonomous Driving

Modern autonomous driving system is characterized as modular tasks in se...

0 Yihan Hu, et al. ∙

research

∙ 11/18/2022

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision

We present a novel bird's-eye-view (BEV) detector with perspective super...

0 Chenyu Yang, et al. ∙

research

∙ 11/17/2022

Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information

To effectively exploit the potential of large-scale models, various pre-...

0 Weijie Su, et al. ∙

research

∙ 11/10/2022

Demystify Transformers Convolutions in Modern Image Deep Networks

Recent success of vision transformers has inspired a series of vision ba...

0 Jifeng Dai, et al. ∙

research

∙ 11/10/2022

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Compared to the great progress of large-scale vision transformers (ViTs)...

0 Wenhai Wang, et al. ∙

research

∙ 09/07/2021

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Transformer, as a strong and flexible architecture for modelling long-ra...

8 Rui Liu, et al. ∙

research

∙ 04/14/2021

Decoupled Spatial-Temporal Transformer for Video Inpainting

Video inpainting aims to fill the given spatiotemporal holes with realis...

27 Rui Liu, et al. ∙

research

∙ 10/08/2020

Deformable DETR: Deformable Transformers for End-to-End Object Detection

DETR has been recently proposed to eliminate the need for many hand-desi...

10 Xizhou Zhu, et al. ∙

research

∙ 09/03/2020

1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask

This article introduces the solutions of the team lvisTraveler for LVIS ...

1 Jingru Tan, et al. ∙

research

∙ 08/22/2019

VL-BERT: Pre-training of Generic Visual-Linguistic Representations

We introduce a new pre-trainable generic representation for visual-lingu...

8 Weijie Su, et al. ∙

Lewei Lu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro