Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

08/15/2016
by   Justin Salamon, et al.
0

The ability of deep convolutional neural networks (CNN) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep convolutional neural network architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-the-art results for environmental sound classification. We show that the improved performance stems from the combination of a deep, high-capacity model and an augmented training set: this combination outperforms both the proposed CNN without augmentation and a "shallow" dictionary learning model with augmentation. Finally, we examine the influence of each augmentation on the model's classification accuracy for each class, and observe that the accuracy for each class is influenced differently by each augmentation, suggesting that the performance of the model could be improved further by applying class-conditional data augmentation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2019

Urban Sound Tagging using Convolutional Neural Networks

In this paper, we propose a framework for environmental sound classifica...
research
01/24/2019

Multi-stream Network With Temporal Attention For Environmental Sound Classification

Environmental sound classification systems often do not perform robustly...
research
08/28/2019

Environment Sound Classification using Multiple Feature Channels and Deep Convolutional Neural Networks

In this paper, we propose a model for the Environment Sound Classificati...
research
04/08/2019

Unsupervised Feature Learning for Environmental Sound Classification Using Cycle Consistent Generative Adversarial Network

In this paper we propose a novel environmental sound classification appr...
research
03/09/2022

Deep Convolutional Neural Network for Roadway Incident Surveillance Using Audio Data

Crash events identification and prediction plays a vital role in underst...
research
01/14/2016

Improved Relation Classification by Deep Recurrent Neural Networks with Data Augmentation

Nowadays, neural networks play an important role in the task of relation...
research
12/27/2019

A Multi-cascaded Model with Data Augmentation for Enhanced Paraphrase Detection in Short Texts

Paraphrase detection is an important task in text analytics with numerou...

Please sign up or login with your details

Forgot password? Click here to reset