PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation

03/17/2022
by   Zhijie Shen, et al.
0

Existing panoramic depth estimation methods based on convolutional neural networks (CNNs) focus on removing panoramic distortions, failing to perceive panoramic structures efficiently due to the fixed receptive field in CNNs. This paper proposes the panorama transformer (named PanoFormer) to estimate the depth in panorama images, with tangent patches from spherical domain, learnable token flows, and panorama specific metrics. In particular, we divide patches on the spherical tangent domain into tokens to reduce the negative effect of panoramic distortions. Since the geometric structures are essential for depth estimation, a self-attention module is redesigned with an additional learnable token flow. In addition, considering the characteristic of the spherical domain, we present two panorama-specific metrics to comprehensively evaluate the panoramic depth estimation models' performance. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art (SOTA) methods. Furthermore, the proposed method can be effectively extended to solve semantic panorama segmentation, a similar pixel2pixel task. Code will be available.

READ FULL TEXT

page 6

page 8

page 11

page 12

page 13

page 17

page 18

page 19

research
08/29/2022

SphereDepth: Panorama Depth Estimation from Spherical Domain

The panorama image can simultaneously demonstrate complete information o...
research
07/14/2020

360^∘ Depth Estimation from Multiple Fisheye Images with Origami Crown Representation of Icosahedron

In this study, we present a method for all-around depth estimation from ...
research
02/20/2023

GlocalFuse-Depth: Fusing Transformers and CNNs for All-day Self-supervised Monocular Depth Estimation

In recent years, self-supervised monocular depth estimation has drawn mu...
research
03/13/2023

DEHRFormer: Real-time Transformer for Depth Estimation and Haze Removal from Varicolored Haze Scenes

Varicolored haze caused by chromatic casts poses haze removal and depth ...
research
04/16/2023

EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation

Estimating the depths of equirectangular (360) images (EIs) is challengi...
research
01/05/2023

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token

Unlike language tasks, where the output space is usually limited to a se...
research
03/18/2022

Distortion-Tolerant Monocular Depth Estimation On Omnidirectional Images Using Dual-cubemap

Estimating the depth of omnidirectional images is more challenging than ...

Please sign up or login with your details

Forgot password? Click here to reset