SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification

10/30/2018
by   Sai Samarth R Phaye, et al.
0

Acoustic Scene Classification (ASC) is one of the core research problems in the field of Computational Sound Scene Analysis. In this work, we present SubSpectralNet, a novel model which captures discriminative features by incorporating frequency band-level differences to model soundscapes. Using mel-spectrograms, we propose the idea of using band-wise crops of the input time-frequency representations and train a convolutional neural network (CNN) on the same. We also propose a modification in the training method for more efficient learning of the CNN models. We first give a motivation for using sub-spectrograms by giving intuitive and statistical analyses and finally we develop a sub-spectrogram based CNN architecture for ASC. The system is evaluated on the public ASC development dataset provided for the "Detection and Classification of Acoustic Scenes and Events" (DCASE) 2018 Challenge. Our best model achieves an improvement of +14 respect to the DCASE 2018 baseline system. Code and figures are available at https://github.com/ssrp/SubSpectralNet

READ FULL TEXT
research
07/02/2019

Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification

This paper proposes a Sub-band Convolutional Neural Network for spoken t...
research
02/14/2020

Acoustic Scene Classification Using Bilinear Pooling on Time-liked and Frequency-liked Convolution Neural Network

The current methodology in tackling Acoustic Scene Classification (ASC) ...
research
11/30/2020

Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

We propose to apply a 2D CNN architecture to 3D MRI image Alzheimer's di...
research
07/16/2020

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

In this technical report, we present a joint effort of four groups, name...
research
05/17/2021

Sound Event Detection with Adaptive Frequency Selection

In this work, we present HIDACT, a novel network architecture for adapti...
research
03/31/2022

1-D CNN based Acoustic Scene Classification via Reducing Layer-wise Dimensionality

This paper presents an alternate representation framework to commonly us...

Please sign up or login with your details

Forgot password? Click here to reset