Lifting Object Detection Datasets into 3D

03/22/2015
by   Joao Carreira, et al.
0

While data has certainly taken the center stage in computer vision in recent years, it can still be difficult to obtain in certain scenarios. In particular, acquiring ground truth 3D shapes of objects pictured in 2D images remains a challenging feat and this has hampered progress in recognition-based object reconstruction from a single image. Here we propose to bypass previous solutions such as 3D scanning or manual design, that scale poorly, and instead populate object category detection datasets semi-automatically with dense, per-object 3D reconstructions, bootstrapped from:(i) class labels, (ii) ground truth figure-ground segmentations and (iii) a small set of keypoint annotations. Our proposed algorithm first estimates camera viewpoint using rigid structure-from-motion and then reconstructs object shapes by optimizing over visual hull proposals guided by loose within-class shape similarity assumptions. The visual hull sampling process attempts to intersect an object's projection cone with the cones of minimal subsets of other similar objects among those pictured from certain vantage points. We show that our method is able to produce convincing per-object 3D reconstructions and to accurately estimate cameras viewpoints on one of the most challenging existing object-category detection datasets, PASCAL VOC. We hope that our results will re-stimulate interest on joint object recognition and 3D reconstruction from a single image.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

page 9

page 12

page 14

research
07/21/2020

Shape and Viewpoint without Keypoints

We present a learning framework that learns to recover the 3D shape, pos...
research
11/22/2014

Category-Specific Object Reconstruction from a Single Image

Object reconstruction from a single image -- in the wild -- is a problem...
research
09/05/2019

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

We propose C3DPO, a method for extracting 3D models of deformable object...
research
11/22/2014

Virtual View Networks for Object Reconstruction

All that structure from motion algorithms "see" are sets of 2D points. W...
research
04/03/2018

3D Interpreter Networks for Viewer-Centered Wireframe Modeling

Understanding 3D object structure from a single image is an important bu...
research
12/01/2020

Unsupervised Part Discovery via Feature Alignment

Understanding objects in terms of their individual parts is important, b...
research
02/17/2022

Visual Ground Truth Construction as Faceted Classification

Recent work in Machine Learning and Computer Vision has provided evidenc...

Please sign up or login with your details

Forgot password? Click here to reset