Receptive-field-regularized CNN variants for acoustic scene classification

09/05/2019
by   Khaled Koutini, et al.
0

Acoustic scene classification and related tasks have been dominated by Convolutional Neural Networks (CNNs). Top-performing CNNs use mainly audio spectograms as input and borrow their architectural design primarily from computer vision. A recent study has shown that restricting the receptive field (RF) of CNNs in appropriate ways is crucial for their performance, robustness and generalization in audio tasks. One side effect of restricting the RF of CNNs is that more frequency information is lost. In this paper, we perform a systematic investigation of different RF configuration for various CNN architectures on the DCASE 2019 Task 1.A dataset. Second, we introduce Frequency Aware CNNs to compensate for the lack of frequency information caused by the restricted RF, and experimentally determine if and in what RF ranges they yield additional improvement. The result of these investigations are several well-performing submissions to different tasks in the DCASE 2019 Challenge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2021

Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks

In this paper, we study the performance of variants of well-known Convol...
research
10/28/2019

Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs

We present CP-JKU submission to MediaEval 2019; a Receptive Field-(RF)-r...
research
07/03/2019

The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification

Convolutional Neural Networks (CNNs) have had great success in many mach...
research
09/15/2023

TF-SepNet: An Efficient 1D Kernel Design in CNNs for Low-Complexity Acoustic Scene Classification

Recent studies focus on developing efficient systems for acoustic scene ...
research
07/27/2020

Receptive-Field Regularized CNNs for Music Classification and Tagging

Convolutional Neural Networks (CNNs) have been successfully used in vari...
research
04/13/2022

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

Speech dereverberation is often an important requirement in robust speec...
research
02/02/2014

Collaborative Receptive Field Learning

The challenge of object categorization in images is largely due to arbit...

Please sign up or login with your details

Forgot password? Click here to reset