OCTraN: 3D Occupancy Convolutional Transformer Network in Unstructured Traffic Scenarios

07/20/2023
by   Aditya Nalgunda Ganesh, et al.
0

Modern approaches for vision-centric environment perception for autonomous navigation make extensive use of self-supervised monocular depth estimation algorithms that output disparity maps. However, when this disparity map is projected onto 3D space, the errors in disparity are magnified, resulting in a depth estimation error that increases quadratically as the distance from the camera increases. Though Light Detection and Ranging (LiDAR) can solve this issue, it is expensive and not feasible for many applications. To address the challenge of accurate ranging with low-cost sensors, we propose, OCTraN, a transformer architecture that uses iterative-attention to convert 2D image features into 3D occupancy features and makes use of convolution and transpose convolution to efficiently operate on spatial information. We also develop a self-supervised training pipeline to generalize the model to any scene by eliminating the need for LiDAR ground truth by substituting it with pseudo-ground truth labels obtained from boosted monocular depth estimation.

READ FULL TEXT

page 2

page 3

research
03/31/2020

Self-supervised Monocular Trained Depth Estimation using Self-attention and Discrete Disparity Volume

Monocular depth estimation has become one of the most studied applicatio...
research
09/08/2021

LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR

Vision-based depth estimation is a key feature in autonomous systems, wh...
research
02/20/2023

Self-Supervised Monocular Depth Estimation with Self-Reference Distillation and Disparity Offset Refinement

Monocular depth estimation plays a fundamental role in computer vision. ...
research
05/15/2020

Exploring the Capabilities and Limits of 3D Monocular Object Detection – A Study on Simulation and Real World Data

3D object detection based on monocular camera data is a key enabler for ...
research
12/31/2021

Sparse LiDAR Assisted Self-supervised Stereo Disparity Estimation

Deep stereo matching has made significant progress in recent years. Howe...
research
12/15/2019

BatVision: Learning to See 3D Spatial Layout with Two Ears

Virtual camera images showing the correct layout of a space ahead can be...
research
04/07/2020

Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation

In this paper, we propose a novel system named Disp R-CNN for 3D object ...

Please sign up or login with your details

Forgot password? Click here to reset