EfficientPS: Efficient Panoptic Segmentation

04/05/2020
by   Rohit Mohan, et al.
16

Understanding the scene in which an autonomous robot operates is critical for its competent functioning. Such scene comprehension necessitates recognizing instances of traffic participants along with general scene semantics which can be effectively addressed by the panoptic segmentation task. In this paper, we introduce the Efficient Panoptic Segmentation (EfficientPS) architecture that consists of a shared backbone which efficiently encodes and fuses semantically rich multi-scale features. We incorporate a new semantic head that aggregates fine and contextual features coherently and a new variant of Mask R-CNN as the instance head. We also propose a novel panoptic fusion module that congruously integrates the output logits from both the heads of our EfficientPS architecture to yield the final panoptic segmentation output. Additionally, we introduce the KITTI panoptic segmentation dataset that contains panoptic annotations for the popularly challenging KITTI benchmark. Extensive evaluations on Cityscapes, KITTI, Mapillary Vistas and Indian Driving Dataset demonstrate that our proposed architecture consistently sets the new state-of-the-art on all these four benchmarks while being the most efficient and fast panoptic segmentation architecture to date.

READ FULL TEXT

page 1

page 12

page 21

page 23

research
08/23/2020

Robust Vision Challenge 2020 – 1st Place Report for Panoptic Segmentation

In this technical report, we present key details of our winning panoptic...
research
05/03/2019

Seamless Scene Segmentation

In this work we introduce a novel, CNN-based architecture that can be tr...
research
12/09/2021

7th AI Driving Olympics: 1st Place Report for Panoptic Tracking

In this technical report, we describe our EfficientLPT architecture that...
research
02/16/2021

EfficientLPS: Efficient LiDAR Panoptic Segmentation

Panoptic segmentation of point clouds is a crucial task that enables aut...
research
05/29/2022

Perceiving the Invisible: Proposal-Free Amodal Panoptic Segmentation

Amodal panoptic segmentation aims to connect the perception of the world...
research
09/19/2023

RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing

The recent advancements in deep convolutional neural networks have shown...
research
01/29/2020

Virtual KITTI 2

This paper introduces an updated version of the well-known Virtual KITTI...

Please sign up or login with your details

Forgot password? Click here to reset