MO-LTR: Multiple Object Localization, Tracking, and Reconstruction from Monocular RGB Videos

12/09/2020
by   Kejie Li, et al.
0

Semantic aware reconstruction is more advantageous than geometric-only reconstruction for future robotic and AR/VR applications because it represents not only where things are, but also what things are. Object-centric mapping is a task to build an object-level reconstruction where objects are separate and meaningful entities that convey both geometry and semantic information. In this paper, we present MO-LTR, a solution to object-centric mapping using only monocular image sequences and camera poses. It is able to localize, track, and reconstruct multiple objects in an online fashion when an RGB camera captures a video of the surrounding. Given a new RGB frame, MO-LTR firstly applies a monocular 3D detector to localize objects of interest and extract their shape codes that represent the object shape in a learned embedding space. Detections are then merged to existing objects in the map after data association. Motion state (i.e. kinematics and the motion status) of each object is tracked by a multiple model Bayesian filter and object shape is progressively refined by fusing multiple shape code. We evaluate localization, tracking, and reconstruction on benchmarking datasets for indoor and outdoor scenes, and show superior performance over previous approaches.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

research
08/09/2022

Learning to Complete Object Shapes for Object-level Mapping in Dynamic Scenes

In this paper, we propose a novel object-level mapping system that can s...
research
11/25/2020

DRACO: Weakly Supervised Dense Reconstruction And Canonicalization of Objects

We present DRACO, a method for Dense Reconstruction And Canonicalization...
research
09/24/2018

Real-Time Monocular Object-Model Aware Sparse SLAM

Simultaneous Localization And Mapping (SLAM) is a fundamental problem in...
research
11/13/2016

Semi-Dense 3D Semantic Mapping from Monocular SLAM

The bundle of geometry and appearance in computer vision has proven to b...
research
11/16/2017

3D Trajectory Reconstruction of Dynamic Objects Using Planarity Constraints

We present a method to reconstruct the three-dimensional trajectory of a...
research
03/08/2021

Localization and Mapping using Instance-specific Mesh Models

This paper focuses on building semantic maps, containing object poses an...
research
02/28/2023

Finding Things in the Unknown: Semantic Object-Centric Exploration with an MAV

Exploration of unknown space with an autonomous mobile robot is a well-s...

Please sign up or login with your details

Forgot password? Click here to reset