BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image

06/01/2023
by   Tao Chu, et al.
0

Understanding and modeling the 3D scene from a single image is a practical problem. A recent advance proposes a panoptic 3D scene reconstruction task that performs both 3D reconstruction and 3D panoptic segmentation from a single image. Although having made substantial progress, recent works only focus on top-down approaches that fill 2D instances into 3D voxels according to estimated depth, which hinders their performance by two ambiguities. (1) instance-channel ambiguity: The variable ids of instances in each scene lead to ambiguity during filling voxel channels with 2D information, confusing the following 3D refinement. (2) voxel-reconstruction ambiguity: 2D-to-3D lifting with estimated single view depth only propagates 2D information onto the surface of 3D regions, leading to ambiguity during the reconstruction of regions behind the frontal view surface. In this paper, we propose BUOL, a Bottom-Up framework with Occupancy-aware Lifting to address the two issues for panoptic 3D scene reconstruction from a single image. For instance-channel ambiguity, a bottom-up framework lifts 2D information to 3D voxels based on deterministic semantic assignments rather than arbitrary instance id assignments. The 3D voxels are then refined and grouped into 3D instances according to the predicted 2D instance centers. For voxel-reconstruction ambiguity, the estimated multi-plane occupancy is leveraged together with depth to fill the whole regions of things and stuff. Our method shows a tremendous performance advantage over state-of-the-art methods on synthetic dataset 3D-Front and real-world dataset Matterport3D. Code and models are available in https://github.com/chtsy/buol.

READ FULL TEXT

page 4

page 6

page 7

page 13

page 14

research
04/08/2021

Semantic Scene Completion via Integrating Instances and Scene in-the-Loop

Semantic Scene Completion aims at reconstructing a complete 3D scene wit...
research
10/13/2021

A Literature Review of 3D Face Reconstruction From a Single Image

This paper is a brief survey of the recent literature on 3D face reconst...
research
07/16/2023

Multi-Object Discovery by Low-Dimensional Object Motion

Recent work in unsupervised multi-object segmentation shows impressive r...
research
06/27/2023

Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

3D Semantic Scene Completion (SSC) has emerged as a nascent and pivotal ...
research
09/17/2018

Seuillage par hystérésis pour le test de photo-consistance des voxels dans le cadre de la reconstruction 3D

Voxel coloring is a popular method of reconstructing a three-dimensional...
research
02/26/2019

Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding

Single-image piece-wise planar 3D reconstruction aims to simultaneously ...
research
10/04/2022

HandFlow: Quantifying View-Dependent 3D Ambiguity in Two-Hand Reconstruction with Normalizing Flow

Reconstructing two-hand interactions from a single image is a challengin...

Please sign up or login with your details

Forgot password? Click here to reset