DAC: Data-free Automatic Acceleration of Convolutional Networks

12/20/2018

∙

Deploying a deep learning model on mobile/IoT devices is a challenging task. The difficulty lies in the trade-off between computation speed and accuracy. A complex deep learning model with high accuracy runs slowly on resource-limited devices, while a light-weight model that runs much faster loses accuracy. In this paper, we propose a novel decomposition method, namely DAC, that is capable of factorizing an ordinary convolutional layer into two layers with much fewer parameters. DAC computes the corresponding weights for the newly generated layers directly from the weights of the original convolutional layer. Thus, no training (or fine-tuning) or any data is needed. The experimental results show that DAC reduces a large number of floating-point operations (FLOPs) while maintaining high accuracy of a pre-trained model. If 2 drop is acceptable, DAC saves 53 ImageNet dataset, 29 dataset, and 46 COCO dataset. Compared to other existing decomposition methods, DAC achieves better performance.

READ FULL TEXT

DAC: Data-free Automatic Acceleration of Convolutional Networks

Sign in with Google

Consider DeepAI Pro