SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

06/06/2023
by   Xuewei Li, et al.
0

As an important and challenging problem in computer vision, PAnoramic Semantic Segmentation (PASS) gives complete scene perception based on an ultra-wide angle of view. Usually, prevalent PASS methods with 2D panoramic image input focus on solving image distortions but lack consideration of the 3D properties of original 360^∘ data. Therefore, their performance will drop a lot when inputting panoramic images with the 3D disturbance. To be more robust to 3D disturbance, we propose our Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation (SGAT4PASS), considering 3D spherical geometry knowledge. Specifically, a spherical geometry-aware framework is proposed for PASS. It includes three modules, i.e., spherical geometry-aware image projection, spherical deformable patch embedding, and a panorama-aware loss, which takes input images with 3D disturbance into account, adds a spherical geometry-aware constraint on the existing deformable patch embedding, and indicates the pixel density of original 360^∘ data, respectively. Experimental results on Stanford2D3D Panoramic datasets show that SGAT4PASS significantly improves performance and robustness, with approximately a 2 increase in mIoU, and when small 3D disturbances occur in the data, the stability of our performance is improved by an order of magnitude. Our code and supplementary material are available at https://github.com/TencentARC/SGAT4PASS.

READ FULL TEXT

page 1

page 3

page 6

page 12

page 13

research
07/25/2022

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation

In this paper, we address panoramic semantic segmentation, which provide...
research
07/14/2023

HEAL-SWIN: A Vision Transformer On The Sphere

High-resolution wide-angle fisheye images are becoming more and more imp...
research
03/02/2022

Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation

Panoramic images with their 360-degree directional view encompass exhaus...
research
09/06/2017

360 Panorama Cloning on Sphere

In this paper, we address a novel problem of cloning a patch of the sour...
research
10/24/2022

SphNet: A Spherical Network for Semantic Pointcloud Segmentation

Semantic segmentation for robotic systems can enable a wide range of app...
research
01/22/2023

BallGAN: 3D-aware Image Synthesis with a Spherical Background

3D-aware GANs aim to synthesize realistic 3D scenes such that they can b...
research
04/10/2019

Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on n-Spheres

Many computer vision challenges require continuous outputs, but tend to ...

Please sign up or login with your details

Forgot password? Click here to reset