ALF: Autoencoder-based Low-rank Filter-sharing for Efficient Convolutional Neural Networks

07/27/2020
by   Alexander Frickenstein, et al.
8

Closing the gap between the hardware requirements of state-of-the-art convolutional neural networks and the limited resources constraining embedded applications is the next big challenge in deep learning research. The computational complexity and memory footprint of such neural networks are typically daunting for deployment in resource constrained environments. Model compression techniques, such as pruning, are emphasized among other optimization methods for solving this problem. Most existing techniques require domain expertise or result in irregular sparse representations, which increase the burden of deploying deep learning applications on embedded hardware accelerators. In this paper, we propose the autoencoder-based low-rank filter-sharing technique technique (ALF). When applied to various networks, ALF is compared to state-of-the-art pruning methods, demonstrating its efficient compression capabilities on theoretical metrics as well as on an accurate, deterministic hardware-model. In our experiments, ALF showed a reduction of 70% in network parameters, 61% in operations and 41% in execution time, with minimal loss in accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
03/22/2023

Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training

Deep neural networks have achieved great success in many data processing...
research
02/03/2018

Recent Advances in Efficient Computation of Deep Convolutional Neural Networks

Deep neural networks have evolved remarkably over the past few years and...
research
03/23/2018

Iterative Low-Rank Approximation for CNN Compression

Deep convolutional neural networks contain tens of millions of parameter...
research
05/14/2019

Network Pruning for Low-Rank Binary Indexing

Pruning is an efficient model compression technique to remove redundancy...
research
09/19/2018

Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are extremely computationally deman...
research
01/19/2018

EffNet: An Efficient Structure for Convolutional Neural Networks

With the ever increasing application of Convolutional Neural Networks to...
research
10/23/2017

A Survey of Model Compression and Acceleration for Deep Neural Networks

Deep convolutional neural networks (CNNs) have recently achieved great s...

Please sign up or login with your details

Forgot password? Click here to reset