An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices

05/24/2021
by   Mohammed Elbtity, et al.
0

In this paper, we develop an in-memory analog computing (IMAC) architecture realizing both synaptic behavior and activation functions within non-volatile memory arrays. Spin-orbit torque magnetoresistive random-access memory (SOT-MRAM) devices are leveraged to realize sigmoidal neurons as well as binarized synapses. First, it is shown the proposed IMAC architecture can be utilized to realize a multilayer perceptron (MLP) classifier achieving orders of magnitude performance improvement compared to previous mixed-signal and digital implementations. Next, a heterogeneous mixed-signal and mixed-precision CPU-IMAC architecture is proposed for convolutional neural networks (CNNs) inference on mobile processors, in which IMAC is designed as a co-processor to realize fully-connected (FC) layers whereas convolution layers are executed in CPU. Architecture-level analytical models are developed to evaluate the performance and energy consumption of the CPU-IMAC architecture. Simulation results exhibit 6.5 LeNet and VGG CNN models, for MNIST and CIFAR-10 pattern recognition tasks, respectively.

READ FULL TEXT

page 1

page 3

research
12/04/2020

A Single-Cycle MLP Classifier Using Analog MRAM-based Neurons and Synapses

In this paper, spin-orbit torque (SOT) magnetoresistive random-access me...
research
04/18/2023

Heterogeneous Integration of In-Memory Analog Computing Architectures with Tensor Processing Units

Tensor processing units (TPUs), specialized hardware accelerators for ma...
research
06/26/2023

CIMulator: A Comprehensive Simulation Platform for Computing-In-Memory Circuit Macros with Low Bit-Width and Real Memory Materials

This paper presents a simulation platform, namely CIMulator, for quantif...
research
06/01/2020

SOT-MRAM based Sigmoidal Neuron for Neuromorphic Architectures

In this paper, the intrinsic physical characteristics of spin-orbit torq...
research
09/19/2017

An Analog Neural Network Computing Engine using CMOS-Compatible Charge-Trap-Transistor (CTT)

An analog neural network computing engine based on CMOS-compatible charg...
research
10/30/2018

A mixed signal architecture for convolutional neural networks

Deep neural network (DNN) accelerators with improved energy and delay ar...
research
09/19/2017

A Memristive Neural Network Computing Engine using CMOS-Compatible Charge-Trap-Transistor (CTT)

A memristive neural network computing engine based on CMOS-compatible ch...

Please sign up or login with your details

Forgot password? Click here to reset