7th AI Driving Olympics: 1st Place Report for Panoptic Tracking

12/09/2021
by   Rohit Mohan, et al.
0

In this technical report, we describe our EfficientLPT architecture that won the panoptic tracking challenge in the 7th AI Driving Olympics at NeurIPS 2021. Our architecture builds upon the top-down EfficientLPS panoptic segmentation approach. EfficientLPT consists of a shared backbone with a modified EfficientNet-B5 model comprising the proximity convolution module as the encoder followed by the range-aware FPN to aggregate semantically rich range-aware multi-scale features. Subsequently, we employ two task-specific heads, the scale-invariant semantic head and hybrid task cascade with feedback from the semantic head as the instance head. Further, we employ a novel panoptic fusion module to adaptively fuse logits from each of the heads to yield the panoptic tracking output. Our approach exploits three consecutive accumulated scans to predict locally consistent panoptic tracking IDs and also the overlap between the scans to predict globally consistent panoptic tracking IDs for a given sequence. The benchmarking results from the 7th AI Driving Olympics at NeurIPS 2021 show that our model is ranked #1 for the panoptic tracking task on the Panoptic nuScenes dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2020

Robust Vision Challenge 2020 – 1st Place Report for Panoptic Segmentation

In this technical report, we present key details of our winning panoptic...
research
04/05/2020

EfficientPS: Efficient Panoptic Segmentation

Understanding the scene in which an autonomous robot operates is critica...
research
05/03/2019

Seamless Scene Segmentation

In this work we introduce a novel, CNN-based architecture that can be tr...
research
07/10/2018

Towards Head Motion Compensation Using Multi-Scale Convolutional Neural Networks

Head pose estimation and tracking is useful in variety of medical applic...
research
07/19/2023

Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline

In dyadic speaker-listener interactions, the listener's head reactions a...
research
04/13/2023

DeepSegmenter: Temporal Action Localization for Detecting Anomalies in Untrimmed Naturalistic Driving Videos

Identifying unusual driving behaviors exhibited by drivers during drivin...

Please sign up or login with your details

Forgot password? Click here to reset