DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts

03/22/2022
by   Yidi Li, et al.
0

Limited by the computational efficiency and accuracy, generating complex 3D scenes remains a challenging problem for existing generation networks. In this work, we propose DepthGAN, a novel method of generating depth maps with only semantic layouts as input. First, we introduce a well-designed cascade of transformer blocks as our generator to capture the structural correlations in depth maps, which makes a balance between global feature aggregation and local attention. Meanwhile, we propose a cross-attention fusion module to guide edge preservation efficiently in depth generation, which exploits additional appearance supervision information. Finally, we conduct extensive experiments on the perspective views of the Structured3d panorama dataset and demonstrate that our DepthGAN achieves superior performance both on quantitative results and visual effects in the depth generation task.Furthermore, 3D indoor scenes can be reconstructed by our generated depth maps with reasonable structure and spatial coherency.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
12/15/2021

Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

In this paper, we aim to solve the problem of consistent depth predictio...
research
04/05/2022

Pyramid Frequency Network with Spatial Attention Residual Refinement Module for Monocular Depth Estimation

Deep-learning-based approaches to depth estimation are rapidly advancing...
research
06/19/2019

Learning to Reconstruct and Understand Indoor Scenes from Sparse Views

This paper proposes a new method for simultaneous 3D reconstruction and ...
research
06/06/2023

RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion

The raw depth image captured by indoor depth sensors usually has an exte...
research
04/12/2022

Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance

Image outpainting technology generates visually reasonable content regar...
research
03/21/2023

360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View

Seeing only a tiny part of the whole is not knowing the full circumstanc...
research
03/28/2023

SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis

Neural Radiance Field (NeRF) significantly degrades when only a limited ...

Please sign up or login with your details

Forgot password? Click here to reset