Tiny Transformers for Environmental Sound Classification at the Edge

03/22/2021
by   David Elliott, et al.
0

With the growth of the Internet of Things and the rise of Big Data, data processing and machine learning applications are being moved to cheap and low size, weight, and power (SWaP) devices at the edge, often in the form of mobile phones, embedded systems, or microcontrollers. The field of Cyber-Physical Measurements and Signature Intelligence (MASINT) makes use of these devices to analyze and exploit data in ways not otherwise possible, which results in increased data quality, increased security, and decreased bandwidth. However, methods to train and deploy models at the edge are limited, and models with sufficient accuracy are often too large for the edge device. Therefore, there is a clear need for techniques to create efficient AI/ML at the edge. This work presents training techniques for audio models in the field of environmental sound classification at the edge. Specifically, we design and train Transformers to classify office sounds in audio clips. Results show that a BERT-based Transformer, trained on Mel spectrograms, can outperform a CNN using 99.85 feature extraction techniques designed for Transformers, using ESC-50 for evaluation, along with various augmentations. Our final model outperforms the state-of-the-art MFCC-based CNN on the office sounds dataset, using just over 6,000 parameters – small enough to run on a microcontroller.

READ FULL TEXT

page 1

page 8

page 12

research
12/09/2021

On The Effect Of Coding Artifacts On Acoustic Scene Classification

Previous DCASE challenges contributed to an increase in the performance ...
research
09/28/2022

Attacking Compressed Vision Transformers

Vision Transformers are increasingly embedded in industrial systems due ...
research
02/10/2022

A VM/Containerized Approach for Scaling TinyML Applications

Although deep neural networks are typically computationally expensive to...
research
10/21/2022

Feature Engineering and Classification Models for Partial Discharge in Power Transformers

To ensure reliability, power transformers are monitored for partial disc...
research
07/07/2022

Training Transformers Together

The infrastructure necessary for training state-of-the-art models is bec...
research
12/02/2019

Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events

We tackle the task of environmental event classification by drawing insp...
research
01/24/2020

Small, Accurate, and Fast Vehicle Re-ID on the Edge: the SAFR Approach

We propose a Small, Accurate, and Fast Re-ID (SAFR) design for flexible ...

Please sign up or login with your details

Forgot password? Click here to reset