BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies

08/20/2021
by   Thomas Verelst, et al.
0

In this paper we propose BlockCopy, a scheme that accelerates pretrained frame-based CNNs to process video more efficiently, compared to standard frame-by-frame processing. To this end, a lightweight policy network determines important regions in an image, and operations are applied on selected regions only, using custom block-sparse convolutions. Features of non-selected regions are simply copied from the preceding frame, reducing the number of computations and latency. The execution policy is trained using reinforcement learning in an online fashion without requiring ground truth annotations. Our universal framework is demonstrated on dense prediction tasks such as pedestrian detection, instance segmentation and semantic segmentation, using both state of the art (Center and Scale Predictor, MGAN, SwiftNet) and standard baseline networks (Mask-RCNN, DeepLabV3+). BlockCopy achieves significant FLOPS savings and inference speedup with minimal impact on accuracy.

READ FULL TEXT

page 1

page 3

page 4

research
11/24/2020

SegBlocks: Block-Based Dynamic Resolution Networks for Real-Time Segmentation

SegBlocks reduces the computational cost of existing neural networks, by...
research
03/21/2018

Fast Semantic Segmentation on Video Using Motion Vector-Based Feature Interpolation

Models optimized for accuracy on challenging, dense prediction tasks suc...
research
07/04/2018

Video Semantic Salient Instance Segmentation: Benchmark Dataset and Baseline

This paper pushes the envelope on salient regions in a video to decompos...
research
11/02/2019

Single-Shot Panoptic Segmentation

We present a novel end-to-end single-shot method that segments countable...
research
04/06/2022

End-to-End Instance Edge Detection

Edge detection has long been an important problem in the field of comput...
research
08/17/2022

Video-TransUNet: Temporally Blended Vision Transformer for CT VFSS Instance Segmentation

We propose Video-TransUNet, a deep architecture for instance segmentatio...
research
08/11/2016

Clockwork Convnets for Video Semantic Segmentation

Recent years have seen tremendous progress in still-image segmentation; ...

Please sign up or login with your details

Forgot password? Click here to reset