Cross Modal Transformer via Coordinates Encoding for 3D Object Dectection

01/03/2023
by   Junjie Yan, et al.
5

In this paper, we propose a robust 3D detector, named Cross Modal Transformer (CMT), for end-to-end 3D multi-modal detection. Without explicit view transformation, CMT takes the image and point clouds tokens as inputs and directly outputs accurate 3D bounding boxes. The spatial alignment of multi-modal tokens is performed implicitly, by encoding the 3D points into multi-modal features. The core design of CMT is quite simple while its performance is impressive. CMT obtains 73.0 Moreover, CMT has a strong robustness even if the LiDAR is missing. Code will be released at https://github.com/junjie18/CMT.

READ FULL TEXT

page 2

page 8

research
04/01/2022

CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection

In autonomous driving, LiDAR point-clouds and RGB images are two major d...
research
07/10/2023

Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement

We propose a system for rearranging objects in a scene to achieve a desi...
research
06/23/2022

Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency

The colorectal polyps classification is a critical clinical examination....
research
03/13/2023

Predicting Density of States via Multi-modal Transformer

The density of states (DOS) is a spectral property of materials, which p...
research
07/21/2022

AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection

Point clouds and RGB images are two general perceptional sources in auto...
research
09/22/2021

KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation

Self-supervised vision-and-language pretraining (VLP) aims to learn tran...
research
10/27/2022

Masked Vision-Language Transformer in Fashion

We present a masked vision-language transformer (MVLT) for fashion-speci...

Please sign up or login with your details

Forgot password? Click here to reset