Visual Wake Words Dataset

06/12/2019
by   Aakanksha Chowdhery, et al.
0

The emergence of Internet of Things (IoT) applications requires intelligence on the edge. Microcontrollers provide a low-cost compute platform to deploy intelligent IoT applications using machine learning at scale, but have extremely limited on-chip memory and compute capability. To deploy computer vision on such devices, we need tiny vision models that fit within a few hundred kilobytes of memory footprint in terms of peak usage and model size on device storage. To facilitate the development of microcontroller friendly models, we present a new dataset, Visual Wake Words, that represents a common microcontroller vision use-case of identifying whether a person is present in the image or not, and provides a realistic benchmark for tiny vision models. Within a limited memory footprint of 250 KB, several state-of-the-art mobile models achieve accuracy of 85-90 anticipate the proposed dataset will advance the research on tiny vision models that can push the pareto-optimal boundary in terms of accuracy versus memory usage for microcontroller applications.

READ FULL TEXT
research
05/25/2019

MoMIT: Porting a JavaScript Interpreter on a Quarter Coin

The Internet of Things (IoT) is a network of physical, heterogeneous, co...
research
07/26/2019

Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT

Model compression has emerged as an important area of research for deplo...
research
02/10/2022

A VM/Containerized Approach for Scaling TinyML Applications

Although deep neural networks are typically computationally expensive to...
research
05/28/2018

NETRA: Enhancing IoT Security using NFV-based Edge Traffic Analysis

This is the era of smart devices or things which are fueling the growth ...
research
05/04/2023

CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning

The emergence of the Internet of Things (IoT) has resulted in a remarkab...
research
07/21/2022

Irrelevant Pixels are Everywhere: Find and Exclude Them for More Efficient Computer Vision

Computer vision is often performed using Convolutional Neural Networks (...
research
05/29/2022

Machine Learning for Microcontroller-Class Hardware – A Review

The advancements in machine learning opened a new opportunity to bring i...

Please sign up or login with your details

Forgot password? Click here to reset