FroDO: From Detections to 3D Objects

05/11/2020
by   Kejie Li, et al.
0

Object-oriented maps are important for scene understanding since they jointly capture geometry and semantics, allow individual instantiation and meaningful reasoning about objects. We introduce FroDO, a method for accurate 3D reconstruction of object instances from RGB video that infers object location, pose and shape in a coarse-to-fine manner. Key to FroDO is to embed object shapes in a novel learnt space that allows seamless switching between sparse point cloud and dense DeepSDF decoding. Given an input sequence of localized RGB frames, FroDO first aggregates 2D detections to instantiate a category-aware 3D bounding box per object. A shape code is regressed using an encoder network before optimizing shape and pose further under the learnt shape priors using sparse and dense shape representations. The optimization uses multi-view geometric, photometric and silhouette losses. We evaluate on real-world datasets, including Pix3D, Redwood-OS, and ScanNet, for single-view, multi-view, and multi-object reconstruction.

READ FULL TEXT

page 2

page 4

page 6

page 8

page 11

page 14

page 15

research
02/23/2023

Category-level Shape Estimation for Densely Cluttered Objects

Accurately estimating the shape of objects in dense clutters makes impor...
research
04/13/2023

ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency

We present ShapeClipper, a novel method that reconstructs 3D object shap...
research
08/01/2021

ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level Ellipsoid and Signed Distance Function Description

Autonomous systems need to understand the semantics and geometry of thei...
research
04/09/2020

Neural Object Descriptors for Multi-View Shape Reconstruction

The choice of scene representation is crucial in both the shape inferenc...
research
04/17/2018

Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction

The goal of this paper is to compare surface-based and volumetric 3D obj...
research
04/27/2023

Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving

Robotic perception requires the modeling of both 3D geometry and semanti...
research
09/17/2023

Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors

3D object-level mapping is a fundamental problem in robotics, which is e...

Please sign up or login with your details

Forgot password? Click here to reset