NeuroTrainer: An Intelligent Memory Module for Deep Learning Training

10/12/2017
by   Duckhwan Kim, et al.
0

This paper presents, NeuroTrainer, an intelligent memory module with in-memory accelerators that forms the building block of a scalable architecture for energy efficient training for deep neural networks. The proposed architecture is based on integration of a homogeneous computing substrate composed of multiple processing engines in the logic layer of a 3D memory module. NeuroTrainer utilizes a programmable data flow based execution model to optimize memory mapping and data re-use during different phases of training operation. A programming model and supporting architecture utilizes the flexible data flow to efficiently accelerate training of various types of DNNs. The cycle level simulation and synthesized design in 15nm FinFET showspower efficiency of 500 GFLOPS/W, and almost similar throughput for a wide range of DNNs including convolutional, recurrent, multi-layer-perceptron, and mixed (CNN+RNN) networks

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 9

page 10

page 11

research
05/19/2020

In-memory Implementation of On-chip Trainable and Scalable ANN for AI/ML Applications

Traditional von Neumann architecture based processors become inefficient...
research
08/14/2021

SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks

In-memory computing (IMC) on a monolithic chip for deep learning faces d...
research
02/20/2017

RESPARC: A Reconfigurable and Energy-Efficient Architecture with Memristive Crossbars for Deep Spiking Neural Networks

Neuromorphic computing using post-CMOS technologies is gaining immense p...
research
03/29/2020

Data-Driven Neuromorphic DRAM-based CNN and RNN Accelerators

The energy consumed by running large deep neural networks (DNNs) on hard...
research
05/20/2022

ALPINE: Analog In-Memory Acceleration with Tight Processor Integration for Deep Learning

Analog in-memory computing (AIMC) cores offers significant performance a...
research
09/22/2016

Deep Learning in Multi-Layer Architectures of Dense Nuclei

We assume that, within the dense clusters of neurons that can be found i...
research
03/16/2018

Memory Slices: A Modular Building Block for Scalable, Intelligent Memory Systems

While reduction in feature size makes computation cheaper in terms of la...

Please sign up or login with your details

Forgot password? Click here to reset