Kinematic 3D Object Detection in Monocular Video

07/19/2020
by   Garrick Brazil, et al.
2

Perceiving the physical world in 3D is fundamental for self-driving applications. Although temporal motion is an invaluable resource to human vision for detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which carefully leverages kinematic motion to improve precision of 3D localization. Specifically, we first propose a novel decomposition of object orientation as well as a self-balancing 3D confidence. We show that both components are critical to enable our kinematic model to work effectively. Collectively, using only a single model, we efficiently leverage 3D kinematics from monocular videos to improve the overall localization precision in 3D object detection while also producing useful by-products of scene dynamics (ego-motion and per-object velocity). We achieve state-of-the-art performance on monocular 3D object detection and the Bird's Eye View tasks within the KITTI self-driving dataset.

READ FULL TEXT

page 2

page 14

research
12/28/2021

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection

Low-cost monocular 3D object detection plays a fundamental role in auton...
research
07/13/2019

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Understanding the world in 3D is a critical component of urban autonomou...
research
04/06/2021

Visual Vibration Tomography: Estimating Interior Material Properties from Monocular Video

An object's interior material properties, while invisible to the human e...
research
12/22/2022

Monocular 3D Object Detection using Multi-Stage Approaches with Attention and Slicing aided hyper inference

3D object detection is vital as it would enable us to capture objects' s...
research
04/08/2021

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Monocular 3D object detection is of great significance for autonomous dr...
research
05/21/2019

aUToTrack: A Lightweight Object Detection and Tracking System for the SAE AutoDrive Challenge

The University of Toronto is one of eight teams competing in the SAE Aut...
research
04/14/2023

CAMM: Building Category-Agnostic and Animatable 3D Models from Monocular Videos

Animating an object in 3D often requires an articulated structure, e.g. ...

Please sign up or login with your details

Forgot password? Click here to reset