Low-Latency Video Semantic Segmentation

04/02/2018
by   Yule Li, et al.
0

Recent years have seen remarkable progress in semantic segmentation. Yet, it remains a challenging task to apply segmentation techniques to video-based applications. Specifically, the high throughput of video streams, the sheer cost of running fully convolutional networks, together with the low-latency requirements in many real-world applications, e.g. autonomous driving, present a significant challenge to the design of the video segmentation framework. To tackle this combined challenge, we develop a framework for video semantic segmentation, which incorporates two novel components: (1) a feature propagation module that adaptively fuses features over time via spatially variant convolution, thus reducing the cost of per-frame computation; and (2) an adaptive scheduler that dynamically allocate computation based on accuracy prediction. Both components work together to ensure low latency while maintaining high segmentation quality. On both Cityscapes and CamVid, the proposed framework obtained competitive performance compared to the state of the art, while substantially reducing the latency, from 360 ms to 119 ms.

READ FULL TEXT

page 5

page 8

research
06/18/2020

Video Semantic Segmentation with Distortion-Aware Feature Correction

Video semantic segmentation is active in recent years benefited from the...
research
08/11/2016

Clockwork Convnets for Video Semantic Segmentation

Recent years have seen tremendous progress in still-image segmentation; ...
research
02/20/2019

An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutions

Assigning a label to each pixel in an image, namely semantic segmentatio...
research
02/11/2022

Borrowing from yourself: Faster future video segmentation with partial channel update

Semantic segmentation is a well-addressed topic in the computer vision l...
research
05/16/2022

Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml

In this paper, we investigate how field programmable gate arrays can ser...
research
05/05/2021

A Case Study of First Person Aiming at Low Latency for Esports

Lower computer system input-to-output latency substantially reduces many...
research
03/30/2020

TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge

Real-time semantic video segmentation is a challenging task due to the s...

Please sign up or login with your details

Forgot password? Click here to reset