Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction

04/17/2018
by   Daeyun Shin, et al.
0

The goal of this paper is to compare surface-based and volumetric 3D object shape representations, as well as viewer-centered and object-centered reference frames for single-view 3D shape prediction. We propose a new algorithm for predicting depth maps from multiple viewpoints, with a single depth or RGB image as input. By modifying the network and the way models are evaluated, we can directly compare the merits of voxels vs. surfaces and viewer-centered vs. object-centered for familiar vs. unfamiliar objects, as predicted from RGB or depth images. Among our findings, we show that surface-based methods outperform voxel representations for objects from novel classes and produce higher resolution outputs. We also find that using viewer-centered coordinates is advantageous for novel objects, while object-centered representations are better for more familiar objects. Interestingly, the coordinate frame significantly affects the shape representation learned, with object-centered placing more importance on implicitly recognizing the object category and viewer-centered producing shape representations with less dependence on category recognition.

READ FULL TEXT

page 4

page 6

page 7

research
02/18/2019

Multi-layer Depth and Epipolar Feature Transformers for 3D Scene Reconstruction

We tackle the problem of automatically reconstructing a complete 3D mode...
research
07/01/2019

Multiview Aggregation for Learning Category-Specific Shape Reconstruction

We investigate the problem of learning category-specific 3D surface shap...
research
05/11/2020

FroDO: From Detections to 3D Objects

Object-oriented maps are important for scene understanding since they jo...
research
07/13/2020

AUTO3D: Novel view synthesis through unsupervisely learned variational viewpoint and global 3D representation

This paper targets on learning-based novel view synthesis from a single ...
research
06/22/2014

3D ShapeNets: A Deep Representation for Volumetric Shapes

3D shape is a crucial but heavily underutilized cue in today's computer ...
research
02/01/2019

Lift-the-Flap: Context Reasoning Using Object-Centered Graphs

Children benefit from lift-the-flap books by taking on an active role in...
research
05/29/2023

Pix2Repair: Implicit Shape Restoration from Images

We present Pix2Repair, an automated shape repair approach that generates...

Please sign up or login with your details

Forgot password? Click here to reset