Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation

06/20/2022
by   Hyunsu Rhee, et al.
0

Real-time video segmentation is a crucial task for many real-world applications such as autonomous driving and robot control. Since state-of-the-art semantic segmentation models are often too heavy for real-time applications despite their impressive performance, researchers have proposed lightweight architectures with speed-accuracy trade-offs, achieving real-time speed at the expense of reduced accuracy. In this paper, we propose a novel framework to speed up any architecture with skip-connections for real-time vision tasks by exploiting the temporal locality in videos. Specifically, at the arrival of each frame, we transform the features from the previous frame to reuse them at specific spatial bins. We then perform partial computation of the backbone network on the regions of the current frame that captures temporal differences between the current and previous frame. This is done by dynamically dropping out residual blocks using a gating mechanism which decides which blocks to drop based on inter-frame distortion. We validate our Spatial-Temporal Mask Generator (STMG) on video semantic segmentation benchmarks with multiple backbone networks, and show that our method largely speeds up inference with minimal loss of accuracy.

READ FULL TEXT

page 2

page 3

page 8

page 9

page 14

page 15

page 16

research
05/27/2021

Feature Reuse and Fusion for Real-time Semantic segmentation

For real-time semantic segmentation, how to increase the speed while mai...
research
11/02/2020

Highway Driving Dataset for Semantic Video Segmentation

Scene understanding is an essential technique in semantic segmentation. ...
research
02/17/2022

Shift-Memory Network for Temporal Scene Segmentation

Semantic segmentation has achieved great accuracy in understanding spati...
research
08/15/2018

CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams

The last few years have brought advances in computer vision at an amazin...
research
11/08/2021

D-Flow: A Real Time Spatial Temporal Model for Target Area Segmentation

Semantic segmentation has attracted a large amount of attention in recen...
research
04/06/2020

Fair Latency-Aware Metric for real-time video segmentation networks

As supervised semantic segmentation is reaching satisfying results, many...
research
07/31/2023

Uncertainty-Guided Spatial Pruning Architecture for Efficient Frame Interpolation

The video frame interpolation (VFI) model applies the convolution operat...

Please sign up or login with your details

Forgot password? Click here to reset