CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams

08/15/2018
by   Lukas Cavigelli, et al.
0

The last few years have brought advances in computer vision at an amazing pace, grounded on new findings in deep neural network construction and training as well as the availability of large labeled datasets. Applying these networks to images demands a high computational effort and pushes the use of state-of-the-art networks on real-time video data out of reach of embedded platforms. Many recent works focus on reducing network complexity for real-time inference on embedded computing platforms. We adopt an orthogonal viewpoint and propose a novel algorithm exploiting the spatio-temporal sparsity of pixel changes. This optimized inference procedure resulted in an average speed-up of 9.1x over cuDNN on the Tegra X2 platform at a negligible accuracy loss of <0.1 and no retraining of the network for a semantic segmentation application. Similarly, an average speed-up of 7.0x has been achieved for a pose detection DNN on static camera video surveillance data. These throughput gains combined with a lower power consumption result in an energy efficiency of 511 GOp/s/W compared to 70 GOp/s/W for the baseline.

READ FULL TEXT

page 2

page 3

page 6

page 7

page 8

page 9

page 12

page 13

research
04/14/2017

CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data

Extracting per-frame features using convolutional neural networks for re...
research
06/20/2022

Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation

Real-time video segmentation is a crucial task for many real-world appli...
research
02/05/2019

Deep Convolutional Generative Adversarial Networks Based Flame Detection in Video

Real-time flame detection is crucial in video based surveillance systems...
research
03/16/2018

EVA^2 : Exploiting Temporal Redundancy in Live Computer Vision

Hardware support for deep convolutional neural networks (CNNs) is critic...
research
11/18/2018

Learning to infer: RL-based search for DNN primitive selection on Heterogeneous Embedded Systems

Deep Learning is increasingly being adopted by industry for computer vis...
research
12/01/2022

Efficient stereo matching on embedded GPUs with zero-means cross correlation

Mobile stereo-matching systems have become an important part of many app...

Please sign up or login with your details

Forgot password? Click here to reset