Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint

11/27/2020
by   Aditya Jyoti Paul, et al.
2

Due to the boom in technical compute in the last few years, the world has seen massive advances in artificially intelligent systems solving diverse real-world problems. But a major roadblock in the ubiquitous acceptance of these models is their enormous computational complexity and memory footprint. Hence efficient architectures and training techniques are required for deployment on extremely low resource inference endpoints. This paper proposes an architecture for detection of alphabets in American Sign Language on an ARM Cortex-M7 microcontroller having just 496 KB of framebuffer RAM. Leveraging parameter quantization is a common technique that might cause varying drops in test accuracy. This paper proposes using interpolation as augmentation amongst other techniques as an efficient method of reducing this drop, which also helps the model generalize well to previously unseen noisy data. The proposed model is about 185 KB post-quantization and inference speed is 20 frames per second.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

research
11/30/2020

A Tiny CNN Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints

The world is going through one of the most dangerous pandemics of all ti...
research
08/25/2023

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

Large language models (LLMs) have revolutionized natural language proces...
research
07/05/2021

Q-SpiNN: A Framework for Quantizing Spiking Neural Networks

A prominent technique for reducing the memory footprint of Spiking Neura...
research
04/05/2022

Efficient Table-based Function Approximation on FPGAs using Interval Splitting and BRAM Instantiation

This paper proposes a novel approach for the generation of memory-effici...
research
03/27/2022

Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge

Text-to-Speech (TTS) services that run on edge devices have many advanta...
research
01/12/2021

A character representation enhanced on-device Intent Classification

Intent classification is an important task in natural language understan...
research
06/05/2022

Learning Binarized Graph Representations with Multi-faceted Quantization Reinforcement for Top-K Recommendation

Learning vectorized embeddings is at the core of various recommender sys...

Please sign up or login with your details

Forgot password? Click here to reset