CenterLoc3D: Monocular 3D Vehicle Localization Network for Roadside Surveillance Cameras

03/28/2022
by   Tang Xinyao, et al.
0

Monocular 3D vehicle localization is an important task in Intelligent Transportation System (ITS) and Cooperative Vehicle Infrastructure System (CVIS), which is usually achieved by monocular 3D vehicle detection. However, depth information cannot be obtained directly by monocular cameras due to the inherent imaging mechanism, resulting in more challenging monocular 3D tasks. Most of the current monocular 3D vehicle detection methods leverage 2D detectors and additional geometric modules, which reduces the efficiency. In this paper, we propose a 3D vehicle localization network CenterLoc3D for roadside monocular cameras, which directly predicts centroid and eight vertexes in image space, and dimension of 3D bounding boxes without 2D detectors. In order to improve the precision of 3D vehicle localization, we propose a weighted-fusion module and a loss with spatial constraints embedding in CenterLoc3D. Firstly, the transformation matrix between 2D image space and 3D world space is solved by camera calibration. Secondly, vehicle type, centroid, eight vertexes and dimension of 3D vehicle bounding boxes are obtained by CenterLoc3D. Finally, centroid in 3D world space can be obtained by camera calibration and CenterLoc3D for 3D vehicle localization. To the best of our knowledge, this is the first application of 3D vehicle localization for roadside monocular cameras. Hence, we also propose a benchmark for this application including dataset (SVLD-3D), annotation tool (LabelImg-3D) and evaluation metrics. Through experimental validation, the proposed method achieves high accuracy and real-time performance.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 9

page 10

page 14

page 15

research
03/29/2021

Monocular 3D Vehicle Detection Using Uncalibrated Traffic Cameras through Homography

This paper proposes a method to extract the position and pose of vehicle...
research
03/22/2017

Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image

In this paper, we present a novel approach, called Deep MANTA (Deep Many...
research
04/22/2019

Forward Vehicle Collision Warning Based on Quick Camera Calibration

Forward Vehicle Collision Warning (FCW) is one of the most important fun...
research
09/15/2023

An Efficient Wide-Range Pseudo-3D Vehicle Detection Using A Single Camera

Wide-range and fine-grained vehicle detection plays a critical role in e...
research
03/08/2020

Monocular 3D Object Detection in Cylindrical Images from Fisheye Cameras

Detecting objects in 3D from a monocular camera has been successfully de...
research
03/07/2023

Calibration-free BEV Representation for Infrastructure Perception

Effective BEV object detection on infrastructure can greatly improve tra...
research
02/01/2017

Evolving Boxes for Fast Vehicle Detection

We perform fast vehicle detection from traffic surveillance cameras. A n...

Please sign up or login with your details

Forgot password? Click here to reset