Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

03/03/2023
by   Shuo Wang, et al.
0

Multi-view 3D object detection (MV3D-Det) in Bird-Eye-View (BEV) has drawn extensive attention due to its low cost and high efficiency. Although new algorithms for camera-only 3D object detection have been continuously proposed, most of them may risk drastic performance degradation when the domain of input images differs from that of training. In this paper, we first analyze the causes of the domain gap for the MV3D-Det task. Based on the covariate shift assumption, we find that the gap mainly attributes to the feature distribution of BEV, which is determined by the quality of both depth estimation and 2D image's feature representation. To acquire a robust depth prediction, we propose to decouple the depth estimation from the intrinsic parameters of the camera (i.e. the focal length) through converting the prediction of metric depth to that of scale-invariant depth and perform dynamic perspective augmentation to increase the diversity of the extrinsic parameters (i.e. the camera poses) by utilizing homography. Moreover, we modify the focal length values to create multiple pseudo-domains and construct an adversarial training loss to encourage the feature representation to be more domain-agnostic. Without bells and whistles, our approach, namely DG-BEV, successfully alleviates the performance drop on the unseen target domain without impairing the accuracy of the source domain. Extensive experiments on various public datasets, including Waymo, nuScenes, and Lyft, demonstrate the generalization and effectiveness of our approach. To the best of our knowledge, this is the first systematic study to explore a domain generalization method for MV3D-Det.

READ FULL TEXT

page 1

page 4

research
09/13/2022

A Benchmark and a Baseline for Robust Multi-view Depth Estimation

Recent deep learning approaches for multi-view depth estimation are empl...
research
04/25/2022

Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection

3D object detection from multiple image views is a fundamental and chall...
research
01/13/2023

OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object Detection

The recent trend for multi-camera 3D object detection is through the uni...
research
05/23/2022

Towards Model Generalization for Monocular 3D Object Detection

Monocular 3D object detection (Mono3D) has achieved tremendous improveme...
research
10/31/2022

Multi-Camera Calibration Free BEV Representation for 3D Object Detection

In advanced paradigms of autonomous driving, learning Bird's Eye View (B...
research
11/03/2022

Progressive Transformation Learning For Leveraging Virtual Images in Training

To effectively interrogate UAV-based images for detecting objects of int...
research
01/25/2023

On the Adversarial Robustness of Camera-based 3D Object Detection

In recent years, camera-based 3D object detection has gained widespread ...

Please sign up or login with your details

Forgot password? Click here to reset