Localization and Mapping using Instance-specific Mesh Models

03/08/2021
by   Qiaojun Feng, et al.
0

This paper focuses on building semantic maps, containing object poses and shapes, using a monocular camera. This is an important problem because robots need rich understanding of geometry and context if they are to shape the future of transportation, construction, and agriculture. Our contribution is an instance-specific mesh model of object shape that can be optimized online based on semantic information extracted from camera images. Multi-view constraints on the object shape are obtained by detecting objects and extracting category-specific keypoints and segmentation masks. We show that the errors between projections of the mesh model and the observed keypoints and masks can be differentiated in order to obtain accurate instance-specific object shapes. We evaluate the performance of the proposed approach in simulation and on the KITTI dataset by building maps of car poses and shapes.

READ FULL TEXT

page 1

page 6

research
10/21/2021

Multi-Category Mesh Reconstruction From Image Collections

Recently, learning frameworks have shown the capability of inferring the...
research
02/23/2023

Category-level Shape Estimation for Densely Cluttered Objects

Accurately estimating the shape of objects in dense clutters makes impor...
research
04/21/2022

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

We study the problem of shape generation in 3D mesh representation from ...
research
06/17/2023

Multi-view 3D Object Reconstruction and Uncertainty Modelling with Neural Shape Prior

3D object reconstruction is important for semantic scene understanding. ...
research
04/22/2022

Implicit Object Mapping With Noisy Data

Modelling individual objects as Neural Radiance Fields (NeRFs) within a ...
research
08/05/2019

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation

We study the problem of shape generation in 3D mesh representation from ...
research
12/09/2020

MO-LTR: Multiple Object Localization, Tracking, and Reconstruction from Monocular RGB Videos

Semantic aware reconstruction is more advantageous than geometric-only r...

Please sign up or login with your details

Forgot password? Click here to reset