Hardware Architecture of Embedded Inference Accelerator and Analysis of Algorithms for Depthwise and Large-Kernel Convolutions

04/29/2021
by   Tse-Wei Chen, et al.
0

In order to handle modern convolutional neural networks (CNNs) efficiently, a hardware architecture of CNN inference accelerator is proposed to handle depthwise convolutions and regular convolutions, which are both essential building blocks for embedded-computer-vision algorithms. Different from related works, the proposed architecture can support filter kernels with different sizes with high flexibility since it does not require extra costs for intra-kernel parallelism, and it can generate convolution results faster than the architecture of the related works. The experimental results show the importance of supporting depthwise convolutions and dilated convolutions with the proposed hardware architecture. In addition to depthwise convolutions with large-kernels, a new structure called DDC layer, which includes the combination of depthwise convolutions and dilated convolutions, is also analyzed in this paper. For face detection, the computational costs decrease by 30 model size decreases by 20 image classification, the accuracy is increased by 1 ×3filters with5 ×5filters in depthwise convolutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2021

CASSOD-Net: Cascaded and Separable Structures of Dilated Convolution for Embedded Vision Systems and Applications

The field of view (FOV) of convolutional neural networks is highly relat...
research
02/03/2020

DWM: A Decomposable Winograd Method for Convolution Acceleration

Winograd's minimal filtering algorithm has been widely used in Convoluti...
research
08/07/2018

Efficient Fusion of Sparse and Complementary Convolutions for Object Recognition and Detection

We propose a new method for exploiting sparsity in convolutional kernels...
research
05/02/2022

Efficient Accelerator for Dilated and Transposed Convolution with Decomposition

Hardware acceleration for dilated and transposed convolution enables rea...
research
06/17/2016

DecomposeMe: Simplifying ConvNets for End-to-End Learning

Deep learning and convolutional neural networks (ConvNets) have been suc...
research
04/03/2023

Tunable Convolutions with Parametric Multi-Loss Optimization

Behavior of neural networks is irremediably determined by the specific l...
research
04/17/2018

IGCV2: Interleaved Structured Sparse Convolutional Neural Networks

In this paper, we study the problem of designing efficient convolutional...

Please sign up or login with your details

Forgot password? Click here to reset