FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

02/27/2019
by   Paul N. Whatmough, et al.
0

The computational demands of computer vision tasks based on state-of-the-art Convolutional Neural Network (CNN) image classification far exceed the energy budgets of mobile devices. This paper proposes FixyNN, which consists of a fixed-weight feature extractor that generates ubiquitous CNN features, and a conventional programmable CNN accelerator which processes a dataset-specific CNN. Image classification models for FixyNN are trained end-to-end via transfer learning, with the common feature extractor representing the transfered part, and the programmable part being learnt on the target dataset. Experimental results demonstrate FixyNN hardware can achieve very high energy efficiencies up to 26.6 TOPS/W (4.81 × better than iso-area programmable accelerator). Over a suite of six datasets we trained models via transfer learning with an accuracy loss of <1% resulting in up to 11.2 TOPS/W - nearly 2 × more efficient than a conventional programmable CNN accelerator of the same area.

READ FULL TEXT
research
12/04/2018

Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

On-device CNN inference for real-time computer vision applications can r...
research
06/04/2019

System Demo for Transfer Learning across Vision and Text using Domain Specific CNN Accelerator for On-Device NLP Applications

Power-efficient CNN Domain Specific Accelerator (CNN-DSA) chips are curr...
research
06/12/2019

Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning

We propose an efficient transfer learning method for adapting ImageNet p...
research
01/23/2019

Programmable Neural Network Trojan for Pre-Trained Feature Extractor

Neural network (NN) trojaning attack is an emerging and important attack...
research
04/20/2023

Visual DNA: Representing and Comparing Images using Distributions of Neuron Activations

Selecting appropriate datasets is critical in modern computer vision. Ho...
research
02/27/2020

A Free-Energy Principle for Representation Learning

This paper employs a formal connection of machine learning with thermody...
research
05/09/2018

Performance evaluation over HW/SW co-design SoC memory transfers for a CNN accelerator

Many FPGAs vendors have recently included embedded processors in their d...

Please sign up or login with your details

Forgot password? Click here to reset