VR3Dense: Voxel Representation Learning for 3D Object Detection and Monocular Dense Depth Reconstruction

04/13/2021
by   Shubham Shrivastava, et al.
6

3D object detection and dense depth estimation are one of the most vital tasks in autonomous driving. Multiple sensor modalities can jointly attribute towards better robot perception, and to that end, we introduce a method for jointly training 3D object detection and monocular dense depth reconstruction neural networks. It takes as inputs, a LiDAR point-cloud, and a single RGB image during inference and produces object pose predictions as well as a densely reconstructed depth map. LiDAR point-cloud is converted into a set of voxels, and its features are extracted using 3D convolution layers, from which we regress object pose parameters. Corresponding RGB image features are extracted using another 2D convolutional neural network. We further use these combined features to predict a dense depth map. While our object detection is trained in a supervised manner, the depth prediction network is trained with both self-supervised and supervised loss functions. We also introduce a loss function, edge-preserving smooth loss, and show that this results in better depth estimation compared to the edge-aware smooth loss function, frequently used in depth prediction works.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
10/29/2022

Boosting Monocular 3D Object Detection with Object-Centric Auxiliary Depth Supervision

Recent advances in monocular 3D detection leverage a depth estimation ne...
research
11/06/2020

Learning a Geometric Representation for Data-Efficient Depth Estimation via Gradient Field and Contrastive Loss

Estimating a depth map from a single RGB image has been investigated wid...
research
04/12/2020

Toward Hierarchical Self-Supervised Monocular Absolute Depth Estimation for Autonomous Driving Applications

In recent years, self-supervised methods for monocular depth estimation ...
research
04/07/2020

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection

Reliable and accurate 3D object detection is a necessity for safe autono...
research
11/24/2021

SM3D: Simultaneous Monocular Mapping and 3D Detection

Mapping and 3D detection are two major issues in vision-based robotics, ...
research
06/07/2020

CubifAE-3D: Monocular Camera Space Cubification on Autonomous Vehicles for Auto-Encoder based 3D Object Detection

We introduce a method for 3D object detection using a single monocular i...
research
06/19/2023

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection

Monocular 3D object detection aims to locate objects in different scenes...

Please sign up or login with your details

Forgot password? Click here to reset