E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

05/30/2023
by   Arshdeep Singh, et al.
0

Sounds carry an abundance of information about activities and events in our everyday environment, such as traffic noise, road works, music, or people talking. Recent machine learning methods, such as convolutional neural networks (CNNs), have been shown to be able to automatically recognize sound activities, a task known as audio tagging. One such method, pre-trained audio neural networks (PANNs), provides a neural network which has been pre-trained on over 500 sound classes from the publicly available AudioSet dataset, and can be used as a baseline or starting point for other tasks. However, the existing PANNs model has a high computational complexity and large storage requirement. This could limit the potential for deploying PANNs on resource-constrained devices, such as on-the-edge sound sensors, and could lead to high energy consumption if many such devices were deployed. In this paper, we reduce the computational complexity and memory requirement of the PANNs model by taking a pruning approach to eliminate redundant parameters from the PANNs model. The resulting Efficient PANNs (E-PANNs) model, which requires 36% less computations and 70% less memory, also slightly improves the sound recognition (audio tagging) performance. The code for the E-PANNs model has been released under an open source license.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2018

DCASE 2018 Challenge baseline with convolutional neural networks

The Detection and Classification of Acoustic Scenes and Events (DCASE) i...
research
09/14/2019

musicnn: Pre-trained convolutional neural networks for music audio tagging

Pronounced as "musician", the musicnn library contains a set of pre-trai...
research
11/15/2018

Audio-based identification of beehive states

The absence of the queen in a beehive is a very strong indicator of the ...
research
07/23/2022

Low-complexity CNNs for Acoustic Scene Classification

This paper presents a low-complexity framework for acoustic scene classi...
research
10/14/2022

Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system

The aim of the Detection and Classification of Acoustic Scenes and Event...
research
06/03/2021

ERANNs: Efficient Residual Audio Neural Networks for Audio Pattern Recognition

We present a new architecture of convolutional neural networks (CNNs) ba...
research
01/21/2019

Learning sound representations using trainable COPE feature extractors

Sound analysis research has mainly been focused on speech and music proc...

Please sign up or login with your details

Forgot password? Click here to reset