Environment Sound Classification using Multiple Feature Channels and Deep Convolutional Neural Networks

08/28/2019
by   Jivitesh Sharma, et al.
8

In this paper, we propose a model for the Environment Sound Classification Task (ESC) that consists of multiple feature channels given as input to a Deep Convolutional Neural Network (CNN). The novelty of the paper lies in using multiple feature channels consisting of Mel-Frequency Cepstral Coefficients (MFCC), Gammatone Frequency Cepstral Coefficients (GFCC), the Constant Q-transform (CQT) and Chromagram. Such multiple features have never been used before for signal or audio processing. Also, we employ a deeper CNN (DCNN) compared to previous models, consisting of 2D separable convolutions working on time and feature domain separately. The model also consists of max pooling layers that downsample time and feature domain separately. We use some data augmentation techniques to further boost performance. Our model is able to achieve state-of-the-art performance on all three benchmark environment sound classification datasets, i.e. the UrbanSound8K (98.60 ESC-50 (95.50 single environment sound classification model is able to achieve state-of-the-art results on all three datasets and by a considerable margin over the previous models. For ESC-10 and ESC-50 datasets, the accuracy achieved by the proposed model is beyond human accuracy of 95.7

READ FULL TEXT

page 1

page 4

page 8

page 9

research
08/15/2016

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

The ability of deep convolutional neural networks (CNN) to learn discrim...
research
12/04/2017

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures

Music, speech, and acoustic scene sound are often handled separately in ...
research
10/27/2018

Short-segment heart sound classification using an ensemble of deep convolutional neural networks

This paper proposes a framework based on deep convolutional neural netwo...
research
08/25/2018

Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

Environmental sound classification (ESC) is an important and challenging...
research
09/27/2019

Urban Sound Tagging using Convolutional Neural Networks

In this paper, we propose a framework for environmental sound classifica...
research
03/09/2022

Deep Convolutional Neural Network for Roadway Incident Surveillance Using Audio Data

Crash events identification and prediction plays a vital role in underst...
research
06/15/2022

Investigating Multi-Feature Selection and Ensembling for Audio Classification

Deep Learning (DL) algorithms have shown impressive performance in diver...

Please sign up or login with your details

Forgot password? Click here to reset