Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image

08/20/2021
by   Weicheng Kuo, et al.
11

3D perception of object shapes from RGB image input is fundamental towards semantic scene understanding, grounding image-based perception in our spatially 3-dimensional real-world environments. To achieve a mapping between image views of objects and 3D shapes, we leverage CAD model priors from existing large-scale databases, and propose a novel approach towards constructing a joint embedding space between 2D images and 3D CAD models in a patch-wise fashion – establishing correspondences between patches of an image view of an object and patches of CAD geometry. This enables part similarity reasoning for retrieving similar CADs to a new image view without exact matches in the database. Our patch embedding provides more robust CAD retrieval for shape estimation in our end-to-end estimation of CAD model shape and pose for detected objects in a single input image. Experiments on in-the-wild, complex imagery from ScanNet show that our approach is more robust than state of the art in real-world scenarios without any exact CAD matches.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 12

page 13

page 14

research
12/03/2021

ROCA: Robust CAD Model Retrieval and Alignment from a Single Image

We present ROCA, a novel end-to-end approach that retrieves and aligns 3...
research
07/26/2020

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

Object recognition has seen significant progress in the image domain, wi...
research
09/21/2020

3D-FUTURE: 3D Furniture shape with TextURE

The 3D CAD shapes in current 3D benchmarks are mostly collected from onl...
research
10/25/2017

Complete 3D Scene Parsing from Single RGBD Image

Inferring the location, shape, and class of each object in a single imag...
research
08/11/2023

U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds

In this paper, we propose U-RED, an Unsupervised shape REtrieval and Def...
research
05/08/2019

ShapeGlot: Learning Language for Shape Differentiation

In this work we explore how fine-grained differences between the shapes ...
research
01/19/2023

Multiview Compressive Coding for 3D Reconstruction

A central goal of visual recognition is to understand objects and scenes...

Please sign up or login with your details

Forgot password? Click here to reset