NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

07/27/2023
by   Chenfeng Xu, et al.
0

We present NeRF-Det, a novel method for indoor 3D detection with posed RGB images as input. Unlike existing indoor 3D detection methods that struggle to model scene geometry, our method makes novel use of NeRF in an end-to-end manner to explicitly estimate 3D geometry, thereby improving 3D detection performance. Specifically, to avoid the significant extra latency associated with per-scene optimization of NeRF, we introduce sufficient geometry priors to enhance the generalizability of NeRF-MLP. Furthermore, we subtly connect the detection and NeRF branches through a shared MLP, enabling an efficient adaptation of NeRF to detection and yielding geometry-aware volumetric representations for 3D detection. Our method outperforms state-of-the-arts by 3.9 mAP and 3.1 mAP on the ScanNet and ARKITScenes benchmarks, respectively. We provide extensive analysis to shed light on how NeRF-Det works. As a result of our joint-training design, NeRF-Det is able to generalize well to unseen scenes for object detection, view synthesis, and depth estimation tasks without requiring per-scene optimization. Code is available at <https://github.com/facebookresearch/NeRF-Det>.

READ FULL TEXT

page 1

page 6

page 8

page 13

research
06/02/2021

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

In this paper, we introduce the task of multi-view RGB-based 3D object d...
research
05/05/2022

Neural 3D Scene Reconstruction with the Manhattan-world Assumption

This paper addresses the challenge of reconstructing 3D indoor scenes fr...
research
12/05/2022

GARF:Geometry-Aware Generalized Neural Radiance Field

Neural Radiance Field (NeRF) has revolutionized free viewpoint rendering...
research
09/02/2021

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

In this work, we present a new multi-view depth estimation method that u...
research
03/28/2018

3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

We present 3DMV, a novel method for 3D semantic scene segmentation of RG...
research
09/02/2019

Geometry Normalization Networks for Accurate Scene Text Detection

Large geometry (e.g., orientation) variances are the key challenges in t...
research
03/21/2022

Depth Completion using Geometry-Aware Embedding

Exploiting internal spatial geometric constraints of sparse LiDARs is be...

Please sign up or login with your details

Forgot password? Click here to reset