CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks

05/28/2019
by   Weicheng Li, et al.
0

Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet[Project URL: <https://github.com/CompactNet/CompactNet>], which automatically optimizes a pre-trained CNN model on a specific resource-limited platform given a specific target of inference speedup. Guided by a simulator of the target platform, CompactNet progressively trims a pre-trained network by removing certain redundant filters until the target speedup is reached and generates an optimal platform-specific model while maintaining the accuracy. We evaluate our work on two platforms of a mobile ARM CPU and a machine learning accelerator NPU (Cambricon-1A ISA) on a Huawei Mate10 smartphone. For the state-of-the-art slim CNN model made for the embedded platform, MobileNetV2, CompactNet achieves up to a 1.8x kernel computation speedup with equal or even higher accuracy for image classification tasks on the Cifar-10 dataset.

READ FULL TEXT

page 1

page 4

research
04/09/2018

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications

This work proposes an automated algorithm, called NetAdapt, that adapts ...
research
06/26/2021

Image Classification with CondenseNeXt for ARM-Based Computing Platforms

In this paper, we demonstrate the implementation of our ultra-efficient ...
research
01/14/2020

A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems

Inference of Convolutional Neural Networks in time critical applications...
research
07/30/2018

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Currently, the neural network architecture design is mostly guided by th...
research
06/14/2019

MediaPipe: A Framework for Building Perception Pipelines

Building applications that perceive the world around them is challenging...
research
05/09/2023

DietCNN: Multiplication-free Inference for Quantized CNNs

The rising demand for networked embedded systems with machine intelligen...
research
07/21/2022

Hardware-Efficient Template-Based Deep CNNs Accelerator Design

Acceleration of Convolutional Neural Network (CNN) on edge devices has r...

Please sign up or login with your details

Forgot password? Click here to reset