GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation

04/16/2022
by   Shi Gong, et al.
0

Birds-eye-view (BEV) semantic segmentation is critical for autonomous driving for its powerful spatial representation ability. It is challenging to estimate the BEV semantic maps from monocular images due to the spatial gap, since it is implicitly required to realize both the perspective-to-BEV transformation and segmentation. We present a novel two-stage Geometry Prior-based Transformation framework named GitNet, consisting of (i) the geometry-guided pre-alignment and (ii) ray-based transformer. In the first stage, we decouple the BEV segmentation into the perspective image segmentation and geometric prior-based mapping, with explicit supervision by projecting the BEV semantic labels onto the image plane to learn visibility-aware features and learnable geometry to translate into BEV space. Second, the pre-aligned coarse BEV features are further deformed by ray-based transformers to take visibility knowledge into account. GitNet achieves the leading performance on the challenging nuScenes and Argoverse Datasets. The code will be publicly available.

READ FULL TEXT

page 4

page 7

page 11

page 14

research
07/09/2023

Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye View

Recent vision-only perception models for autonomous driving achieved pro...
research
08/06/2021

Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Bird's-Eye-View (BEV) maps have emerged as one of the most powerful repr...
research
03/08/2022

BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs

Semantic segmentation in bird's eye view (BEV) is an important task for ...
research
04/11/2022

HFT: Lifting Perspective Representations via Hybrid Feature Transformation

Autonomous driving requires accurate and detailed Bird's Eye View (BEV) ...
research
03/15/2023

Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation

The mainstream CNN-based remote sensing (RS) image semantic segmentation...
research
05/07/2023

Bi-Mapper: Holistic BEV Semantic Mapping for Autonomous Driving

A semantic map of the road scene, covering fundamental road elements, is...
research
05/31/2022

ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation

Generating a detailed near-field perceptual model of the environment is ...

Please sign up or login with your details

Forgot password? Click here to reset