End-to-end Learning Improves Static Object Geo-localization in Monocular Video

04/10/2020
by   Mohamed Chaabane, et al.
0

Accurately estimating the position of static objects, such as traffic lights, from the moving camera of a self-driving car is a challenging problem. In this work, we present a system that improves the localization of static objects by jointly-optimizing the components of the system via learning. Our system is comprised of networks that perform: 1) 6DoF object pose estimation from a single image, 2) association of objects between pairs of frames, and 3) multi-object tracking to produce the final geo-localization of the static objects within the scene. We evaluate our approach using a publicly-available data set, focusing on traffic lights due to data availability. For each component, we compare against contemporary alternatives and show significantly-improved performance. We also show that the end-to-end system performance is further improved via joint-training of the constituent models.

READ FULL TEXT

page 3

page 8

research
07/13/2020

End-to-End Multi-Object Tracking with Global Response Map

Most existing Multi-Object Tracking (MOT) approaches follow the Tracking...
research
06/28/2022

3D Multi-Object Tracking with Differentiable Pose Estimation

We propose a novel approach for joint 3D multi-object tracking and recon...
research
04/24/2023

MOTLEE: Distributed Mobile Multi-Object Tracking with Localization Error Elimination

We present MOTLEE, a distributed mobile multi-object tracking algorithm ...
research
02/24/2022

TwistSLAM: Constrained SLAM in Dynamic Environment

Moving objects are present in most scenes of our life. However they can ...
research
02/13/2018

Joint 3D Reconstruction of a Static Scene and Moving Objects

We present a technique for simultaneous 3D reconstruction of static regi...
research
04/02/2019

Improving Robot Success Detection using Static Object Data

We use static object data to improve success detection for stacking obje...
research
04/06/2022

"The Pedestrian next to the Lamppost" Adaptive Object Graphs for Better Instantaneous Mapping

Estimating a semantically segmented bird's-eye-view (BEV) map from a sin...

Please sign up or login with your details

Forgot password? Click here to reset