A Column Streaming-Based Convolution Engine and Mapping Algorithm for CNN-based Edge AI accelerators

09/15/2021
by   Weison Lin, et al.
0

Edge AI accelerators have been emerging as a solution for near customers' applications in areas such as unmanned aerial vehicles (UAVs), image recognition sensors, wearable devices, robotics, and remote sensing satellites. These applications not only require meeting performance targets but also meeting strict area and power constraints due to their portable mobility feature and limited power sources. As a result, a column streaming-based convolution engine has been proposed in this paper that includes column sets of processing elements design for flexibility in terms of the applicability for different CNN algorithms in edge AI accelerators. Comparing to a commercialized CNN accelerator, the key results reveal that the column streaming-based convolution engine requires similar execution cycles for processing a 227 x 227 feature map with avoiding zero-padding penalties.

READ FULL TEXT

page 2

page 4

research
04/27/2015

On-Board Vision Processing For Small UAVs: Time to Rethink Strategy

The ultimate research goal for unmanned aerial vehicles (UAVs) is to fac...
research
09/04/2023

SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on FPGA Devices

AI has led to significant advancements in computer vision and image proc...
research
07/15/2023

PASS: Exploiting Post-Activation Sparsity in Streaming Architectures for CNN Acceleration

With the ever-growing popularity of Artificial Intelligence, there is an...
research
10/08/2022

AI and ML Accelerator Survey and Trends

This paper updates the survey of AI accelerators and processors from pas...
research
07/25/2023

Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation

The unprecedented accuracy of convolutional neural networks (CNNs) acros...
research
05/10/2018

GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks

Generative Adversarial Networks (GANs) are one of the most recent deep l...
research
02/07/2021

CrossStack: A 3-D Reconfigurable RRAM Crossbar Inference Engine

Deep neural network inference accelerators are rapidly growing in import...

Please sign up or login with your details

Forgot password? Click here to reset