Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

03/04/2022
by   Issa Mouawad, et al.
0

Monocular 3D object detection continues to attract attention due to the cost benefits and wider availability of RGB cameras. Despite the recent advances and the ability to acquire data at scale, annotation cost and complexity still limit the size of 3D object detection datasets in the supervised settings. Self-supervised methods, on the other hand, aim at training deep networks relying on pretext tasks or various consistency constraints. Moreover, other 3D perception tasks (such as depth estimation) have shown the benefits of temporal priors as a self-supervision signal. In this work, we argue that the temporal consistency on the level of object poses, provides an important supervision signal given the strong prior on physical motion. Specifically, we propose a self-supervised loss which uses this consistency, in addition to render-and-compare losses, to refine noisy pose predictions and derive high-quality pseudo labels. To assess the effectiveness of the proposed method, we finetune a synthetically trained monocular 3D object detection model using the pseudo-labels that we generated on real data. Evaluation on the standard KITTI3D benchmark demonstrates that our method reaches competitive performance compared to other monocular self-supervised and supervised methods.

READ FULL TEXT

page 2

page 3

page 6

research
09/30/2020

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

3D object detection from monocular images is an ill-posed problem due to...
research
05/29/2023

View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection

For autonomous vehicles, driving safely is highly dependent on the capab...
research
07/07/2022

Self-Supervised Velocity Estimation for Automotive Radar Object Detection Networks

This paper presents a method to learn the Cartesian velocity of objects ...
research
02/05/2021

Custom Object Detection via Multi-Camera Self-Supervised Learning

This paper proposes MCSSL, a self-supervised learning approach for build...
research
01/24/2022

Consistent 3D Hand Reconstruction in Video via self-supervised Learning

We present a method for reconstructing accurate and consistent 3D hands ...
research
12/31/2022

Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security Checkpoints

We introduce a novel framework to track multiple objects in overhead cam...
research
06/08/2023

2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction

With the advent of the big model era, the demand for data has become mor...

Please sign up or login with your details

Forgot password? Click here to reset