CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs

01/19/2018
by   Liangzhen Lai, et al.
0

Deep Neural Networks are becoming increasingly popular in always-on IoT edge devices performing data analytics right at the source, reducing latency as well as energy consumption for data communication. This paper presents CMSIS-NN, efficient kernels developed to maximize the performance and minimize the memory footprint of neural network (NN) applications on Arm Cortex-M processors targeted for intelligent IoT edge devices. Neural network inference based on CMSIS-NN kernels achieves 4.6X improvement in runtime/throughput and 4.9X improvement in energy efficiency.

READ FULL TEXT
research
11/09/2021

Ultra-Low Power Keyword Spotting at the Edge

Keyword spotting (KWS) has become an indispensable part of many intellig...
research
05/30/2019

Toward Runtime-Throttleable Neural Networks

As deep neural network (NN) methods have matured, there has been increas...
research
02/07/2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup

DNN inference requires huge effort of system development and resource co...
research
09/16/2023

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations

Web applications are increasingly becoming the primary platform for AI s...
research
01/14/2021

Enabling Large Neural Networks on Tiny Microcontrollers with Swapping

Running neural networks (NNs) on microcontroller units (MCUs) is becomin...
research
06/12/2014

A Cascade Neural Network Architecture investigating Surface Plasmon Polaritons propagation for thin metals in OpenMP

Surface plasmon polaritons (SPPs) confined along metal-dielectric interf...
research
10/06/2021

Shifting Capsule Networks from the Cloud to the Deep Edge

Capsule networks (CapsNets) are an emerging trend in image processing. I...

Please sign up or login with your details

Forgot password? Click here to reset