DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos

03/08/2022
by   Mathias Parger, et al.
0

Convolutional neural network inference on video data requires powerful hardware for real-time processing. Given the inherent coherence across consecutive frames, large parts of a video typically change little. By skipping identical image regions and truncating insignificant pixel updates, computational redundancy can in theory be reduced significantly. However, these theoretical savings have been difficult to translate into practice, as sparse updates hamper computational consistency and memory access coherence; which are key for efficiency on real hardware. With DeltaCNN, we present a sparse convolutional neural network framework that enables sparse frame-by-frame updates to accelerate video inference in practice. We provide sparse implementations for all typical CNN layers and propagate sparse feature updates end-to-end - without accumulating errors over time. DeltaCNN is applicable to all convolutional neural networks without retraining. To the best of our knowledge, we are the first to significantly outperform the dense reference, cuDNN, in practical settings, achieving speedups of up to 7x with only marginal differences in accuracy.

READ FULL TEXT

page 6

page 7

research
10/18/2022

MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos

Convolutional neural network inference on video input is computationally...
research
04/14/2017

CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data

Extracting per-frame features using convolutional neural networks for re...
research
11/23/2016

Deep Feature Flow for Video Recognition

Deep convolutional neutral networks have achieved great success on image...
research
11/20/2020

ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis

Deep Convolutional Neural Networks (CNNs) are powerful models that have ...
research
03/16/2018

EVA^2 : Exploiting Temporal Redundancy in Live Computer Vision

Hardware support for deep convolutional neural networks (CNNs) is critic...
research
02/21/2022

Dynamic Sampling Rate: Harnessing Frame Coherence in Graphics Applications for Energy-Efficient GPUs

In real-time rendering, a 3D scene is modelled with meshes of triangles ...
research
05/16/2018

A practical convolutional neural network as loop filter for intra frame

Loop filters are used in video coding to remove artifacts or improve per...

Please sign up or login with your details

Forgot password? Click here to reset