STEP: Segmenting and Tracking Every Pixel

02/23/2021
by   Mark Weber, et al.
0

In this paper, we tackle video panoptic segmentation, a task that requires assigning semantic classes and track identities to all pixels in a video. To study this important problem in a setting that requires a continuous interpretation of sensory data, we present a new benchmark: Segmenting and Tracking Every Pixel (STEP), encompassing two datasets, KITTI-STEP, and MOTChallenge-STEP together with a new evaluation metric. Our work is the first that targets this task in a real-world setting that requires dense interpretation in both spatial and temporal domains. As the ground-truth for this task is difficult and expensive to obtain, existing datasets are either constructed synthetically or only sparsely annotated within short video clips. By contrast, our datasets contain long video sequences, providing challenging examples and a test-bed for studying long-term pixel-precise segmentation and tracking. For measuring the performance, we propose a novel evaluation metric Segmentation and Tracking Quality (STQ) that fairly balances semantic and tracking aspects of this task and is suitable for evaluating sequences of arbitrary length. We will make our datasets, metric, and baselines publicly available.

READ FULL TEXT

page 1

page 3

page 13

research
06/19/2020

Video Panoptic Segmentation

Panoptic segmentation has become a new standard of visual recognition ta...
research
12/09/2020

ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation

In this paper, we present ViP-DeepLab, a unified model attempting to tac...
research
09/26/2022

EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations

We introduce VISOR, a new dataset of pixel annotations and a benchmark s...
research
12/04/2017

Long-Term Visual Object Tracking Benchmark

In this paper, we propose a new long video dataset (called Track Long an...
research
04/10/2023

Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims to achieve comprehensive pixel-le...
research
08/07/2019

Visual Coin-Tracking: Tracking of Planar Double-Sided Objects

We introduce a new video analysis problem -- tracking of rigid planar ob...
research
12/13/2021

SVIP: Sequence VerIfication for Procedures in Videos

In this paper, we propose a novel sequence verification task that aims t...

Please sign up or login with your details

Forgot password? Click here to reset