Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

12/15/2021
by   Zitian Zhang, et al.
0

In this paper, we aim to solve the problem of consistent depth prediction in complex scenes under various illumination conditions. The existing indoor datasets based on RGB-D sensors or virtual rendering have two critical limitations - sparse depth maps (NYU Depth V2) and non-realistic illumination (SUN CG, SceneNet RGB-D). We propose to use internet 3D indoor scenes and manually tune their illuminations to render photo-realistic RGB photos and their corresponding depth and BRDF maps, obtaining a new indoor depth dataset called Vari dataset. We propose a simple convolutional block named DCA by applying depthwise separable dilated convolution on encoded features to process global information and reduce parameters. We perform cross attention on these dilated features to retain the consistency of depth prediction under different illuminations. Our method is evaluated by comparing it with current state-of-the-art methods on Vari dataset and a significant improvement is observed in our experiments. We also conduct the ablation study, finetune our model on NYU Depth V2 and also evaluate on real-world data to further validate the effectiveness of our DCA block. The code, pre-trained weights and Vari dataset are open-sourced.

READ FULL TEXT

page 5

page 6

page 7

page 8

page 11

page 12

page 13

page 14

research
06/21/2019

Deep RGB-D Canonical Correlation Analysis For Sparse Depth Completion

In this paper, we propose our Correlation For Completion Network (CFCNet...
research
03/22/2022

DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts

Limited by the computational efficiency and accuracy, generating complex...
research
01/16/2013

Indoor Semantic Segmentation using depth information

This work addresses multi-class segmentation of indoor scenes with RGB-D...
research
11/28/2020

AdaBins: Depth Estimation using Adaptive Bins

We address the problem of estimating a high quality dense depth map from...
research
04/02/2018

MegaDepth: Learning Single-View Depth Prediction from Internet Photos

Single-view depth prediction is a fundamental problem in computer vision...
research
12/12/2021

BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning

Providing omnidirectional depth along with RGB information is important ...
research
07/11/2018

Deep attention-based classification network for robust depth prediction

In this paper, we present our deep attention-based classification (DABC)...

Please sign up or login with your details

Forgot password? Click here to reset