Voxel Field Fusion for 3D Object Detection

05/31/2022
by   Yanwei Li, et al.
0

In this work, we present a conceptually simple yet effective framework for cross-modality 3D object detection, named voxel field fusion. The proposed approach aims to maintain cross-modality consistency by representing and fusing augmented image features as a ray in the voxel field. To this end, the learnable sampler is first designed to sample vital features from the image plane that are projected to the voxel grid in a point-to-ray manner, which maintains the consistency in feature representation with spatial context. In addition, ray-wise fusion is conducted to fuse features with the supplemental context in the constructed voxel field. We further develop mixed augmentor to align feature-variant transformations, which bridges the modality gap in data augmentation. The proposed framework is demonstrated to achieve consistent gains in various benchmarks and outperforms previous fusion-based methods on KITTI and nuScenes datasets. Code is made available at https://github.com/dvlab-research/VFF.

READ FULL TEXT
research
06/01/2022

Unifying Voxel-based Representation with Transformer for 3D Object Detection

In this work, we present a unified framework for multi-modality 3D objec...
research
07/18/2023

MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection

In this paper, we propose a novel and effective Multi-Level Fusion netwo...
research
08/08/2021

From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder

In this paper, we present an Intersection-over-Union (IoU) guided two-st...
research
03/07/2023

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

LiDAR-camera fusion methods have shown impressive performance in 3D obje...
research
04/19/2023

MMDR: A Result Feature Fusion Object Detection Approach for Autonomous System

Object detection has been extensively utilized in autonomous systems in ...
research
07/17/2020

EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection

In this paper, we aim at addressing two critical issues in the 3D detect...
research
11/01/2021

VPFNet: Voxel-Pixel Fusion Network for Multi-class 3D Object Detection

Many LiDAR-based methods for detecting large objects, single-class objec...

Please sign up or login with your details

Forgot password? Click here to reset