Keyword Spotting System and Evaluation of Pruning and Quantization Methods on Low-power Edge Microcontrollers

08/04/2022
by   Jingyi Wang, et al.
2

Keyword spotting (KWS) is beneficial for voice-based user interactions with low-power devices at the edge. The edge devices are usually always-on, so edge computing brings bandwidth savings and privacy protection. The devices typically have limited memory spaces, computational performances, power and costs, for example, Cortex-M based microcontrollers. The challenge is to meet the high computation and low-latency requirements of deep learning on these devices. This paper firstly shows our small-footprint KWS system running on STM32F7 microcontroller with Cortex-M7 core @216MHz and 512KB static RAM. Our selected convolutional neural network (CNN) architecture has simplified number of operations for KWS to meet the constraint of edge devices. Our baseline system generates classification results for each 37ms including real-time audio feature extraction part. This paper further evaluates the actual performance for different pruning and quantization methods on microcontroller, including different granularity of sparsity, skipping zero weights, weight-prioritized loop order, and SIMD instruction. The result shows that for microcontrollers, there are considerable challenges for accelerate unstructured pruned models, and the structured pruning is more friendly than unstructured pruning. The result also verified that the performance improvement for quantization and SIMD instruction.

READ FULL TEXT
research
08/29/2019

An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM

The high computation and memory storage of large deep neural networks (D...
research
12/13/2021

Event Based Time-Vectors for auditory features extraction: a neuromorphic approach for low power audio recognition

In recent years tremendous efforts have been done to advance the state o...
research
08/13/2021

Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio Classification on Edge-devices

Deep Learning has celebrated resounding successes in many application ar...
research
08/31/2021

Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Using smart wearable devices to monitor patients electrocardiogram (ECG)...
research
12/11/2021

Achieving Low Complexity Neural Decoders via Iterative Pruning

The advancement of deep learning has led to the development of neural de...
research
03/22/2018

A Quantization-Friendly Separable Convolution for MobileNets

As deep learning (DL) is being rapidly pushed to edge computing, researc...
research
03/25/2021

Prototype-based Personalized Pruning

Nowadays, as edge devices such as smartphones become prevalent, there ar...

Please sign up or login with your details

Forgot password? Click here to reset