Multiview Compressive Coding for 3D Reconstruction

01/19/2023
by   Chao-Yuan Wu, et al.
11

A central goal of visual recognition is to understand objects and scenes from a single image. 2D recognition has witnessed tremendous progress thanks to large-scale learning and general-purpose representations. Comparatively, 3D poses new challenges stemming from occlusions not depicted in the image. Prior works try to overcome these by inferring from multiple views or rely on scarce CAD models and category-specific priors which hinder scaling to novel settings. In this work, we explore single-view 3D reconstruction by learning generalizable representations inspired by advances in self-supervised learning. We introduce a simple framework that operates on 3D points of single objects or whole scenes coupled with category-agnostic large-scale training from diverse RGB-D videos. Our model, Multiview Compressive Coding (MCC), learns to compress the input appearance and geometry to predict the 3D structure by querying a 3D-aware decoder. MCC's generality and efficiency allow it to learn from large-scale and diverse data sources with strong generalization to novel objects imagined by DALL·E 2 or captured in-the-wild with an iPhone.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 8

research
09/03/2019

Few-Shot Generalization for Single-Image 3D Reconstruction via Priors

Recent work on single-view 3D reconstruction shows impressive results, b...
research
12/17/2020

Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image

We present Worldsheet, a method for novel view synthesis using just a si...
research
08/20/2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image

3D perception of object shapes from RGB image input is fundamental towar...
research
07/22/2022

InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images

We present a method for learning to generate unbounded flythrough videos...
research
09/18/2017

Matterport3D: Learning from RGB-D Data in Indoor Environments

Access to large, diverse RGB-D datasets is critical for training RGB-D s...
research
07/18/2023

NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF

Remarkable progress has been made in 3D reconstruction from single-view ...
research
07/23/2017

Compact Model Representation for 3D Reconstruction

3D reconstruction from 2D images is a central problem in computer vision...

Please sign up or login with your details

Forgot password? Click here to reset